PCT 



4 



WORLD INTELLECTUAL PROPERTY ORGANIZATION 
Interoationt) Bureau 




INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 7 : 

C1ZN 15/12, C07K 14/47, A61K 3S/17, 
G01N 33753, C12Q 1/68, C12N 15/62, 
C07K 16/18 



A2 



(11) International Publication Number: WO 00/53758 

(43) Internationa} Publication Date: 1 4 September 2000 (14.09.00) 



(21) International Application 

(22) International Filing Date: 



PCT/US00/O5&41 
2 March 2000 (02J03.00) 



(30) Priority Data: 

PCT/US99/05028 

60/123,618 

60/12X957 

60/l2S,77S 

60/128^49 

PCT/US99/08615 

60/131.445 

60/132^71 

60/13437 

PCT/US99/122S2 

60/141,037 

60/144.758 

60/145,698 

60/146,222 

PCI7US99/20U1 

PCT/US99/20594 

PCI7US99/20944 

PCT/US99/21090 

PCT/US99/21547 

PCIYUS99V23Q89 

60/162,506 

PCT/US99/28214 

PCIVUS99/283I3 

PCI7US99/28409 

PCT/US99/28301 

PC77US99/28634 

PCT/US99/2E551 

PCI7US99/28564 

PCT/US99/28565 

PCT/US99/30095 

PCT/US99/30999 

PCT/US99/31274 

PCT/USOO/00219 

PCT/US00/0O277 

PCT/USOO/00376 

PCr/USOO/03565 

pcr/usocyo434i 

PCT/USOG/04342 
PCI7US0CVD4414 



8 March 1999 (080)3.99) US 

10 March 1999(1 0.03.99) US 
12 March 1999 (12.03.99) US 
23 March 1999 (2343.99) US 

12 April 1999 (124*99) US 
20 April 1999 (2044^9) US 
28 April 1999 (2844.99) US 

4 May 1999(0445.99) US 

14 May 1999(1445.99) US 
2 June 1999(0246.99) US 
23 June 1999(2346.99) US 
20 July 1999(2047.99) US 
26 July 1999(26.07.99) US 

28 July 1999(2847.99) US 
I September 1999 (01 49.99) US 
8 September 1999 (0849.99) US 

13 September 1999 (1349.99) US 

15 September 1999 (1549.99) US 

15 September 1999 (1549.99) US 

5 October 1999 (05.10.99) US 

29 October 1999 (29.10.99) US 

29 November 1999 (29.1 1.99) US 

30 November 1999 (30.1 1 .99) US 
30 November 1999(30.11 .99) US 
1 December 1999 (01.IZ99) US 

1 December 1999 (01.12.99) US 

2 December 1999 (02.12.99) US 
2 December 1999 (02.12.99) US 
2 December 1999 (02.12.99) US 

16 December 1999 (16.12.99) US 
20 December 1999 (20.12.99) US 
30 December 1 999 (30.1 2.99) US 

5 January 2000 (05.01 .00) US 

6 January 2000 (0641 40) US 
6 January 2000 (064140) US 

11 February 2000 (1142.00) US 
18 February 2000 (184240) US 
18 February 2000 (184240) US 
22 February 2000 (224240) US 



(71) Applicant (for all designated Stales except US): GENENTECH, 

INC [US/US]; I DNA Way, South San Francisco, CA 
94080-4990 (US). 

(72) Inventors; and 

(75) Invented Applicants (for US only): ASHKENAZI, Avi, J. 
[US/US* 1456 Tarrytown Soeet, San Mateo, CA 94402 
(US). BAKER. Kevin, P. {GB/US]; 14006 Indian Run 
Drive, Damestown, MD 20878 (US). GODDARD, Audrey 
[CA/US); 110 Congo Street. San Francisco, CA 94131 
(US). GURNEY, Austin, L. fUS/US); 1 Debbie Lane, 
Belmont, CA 94002 (USX HEBERT. Caroline [US/US J; 
1809 Vine Street, Berkeiey, CA 94703 (US). HENZEL, 
William [US/US]; 3724 Soutbwood Drive, San Mateo, 
CA 94030 (US). KABAKOFF, Rhona, C. (BR/USfc 1084 
Granada Drive, Pacifica, CA 94044 (US). LU, Yanmei 
[CNAJS]; 1001 Continentals Way #206, Belmont. CA 94002 
(US). PAN, James [CA/US); 2705 Coronet Boulevard, 
Belmont. CA 94002 (US). PENNICA. Diane fUS/US); 
2417 Hale Drive, Buriingame. CA 94010 (US). S HELTON, 
David, L. [US/USJ 5845 Clover Drive, Oakland, CA 
94618 (US). SMITH, Victoria IAU/US); 19 Dwight Road, 
Buriingame, CA 94010 (US). STEWART, Timothy, A. 
fUS/USJ; 465 Douglass Street, San Francisco, CA 941 14 
(US). TUMAS, Daniel (US/US): 3 Rae Court, Orinda, 
CA 94563 (US). WATANABE, Colin, K. [US/US]; 128 
Corliss Drive, Moraga, CA 94556 (US). WOOD, William, 
I. [US/US]; 35 Southdown Court, Hillsborough, CA 94010 
(US). YAN, Minhong [CN/US]; 1910 Garden Drive #114, 
Buriingame, CA 94010 (US). 

(74) Agents: SVOBODA, Craig, G. et al^ Genentech, Inc^ 1 DNA 
Way, South San Francisco, CA 94080-4990 (US). 



(81) Designated States: AE, AL, AM, AT, AU, AZ, BA, BB, BG, 
BR, BY, CA, CH. CN, CR, CU. CZ, DE, DK, DM. EE, 
ES, Fl, GB, GD, GE. GH, GM, HR. HU, ID, IL, IN, IS, JP, 
KE, KG, KP. KR. KZ. LC, LK, LR, LS, LT, LU. LV, MA, 
MD, MG, MK, MN. MW, MX, NO. NZ, PL, PT, RO. RU, 
SD, SE, SG, SI SK, SL, 17, TM, TR, TT, TZ, UA, UG, 
US, UZ, VN, YU. ZA. ZW, ARIPO patent (GH, GM, KE, 
LS, MW, SD. SL, SZ. TZ, UG, ZW), Eurasian patent (AM, 
AZ, BY, KG, KZ, MD. RU, 77, TM), European patent (AT, 
BE. CH, CY, DE, DK, ES, FI, FR, GB, GR, IE, IT, LU, 
MC NL, PT, SE). OAPI patent (BF, BJ, CF, CG, O, CM. 
GA, GN, GW, ML, MR, NE, SN, TD, TG). 

Published 

Without international search report and to be republished 
upon receipt of that report. 



(54) Title: COMPOSITIONS AND METHODS FOR THE TREATMENT OF IMMUNE RELATED DISEASES 
(57) Abstract 

The present invention relates to a composition containing novel proteins and methods For the diagnosis and treatment of immune 
related diseases. 



BEST AVAILABLE COPY 

(0 2003 Copyright Derwent Information Ltd. 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCI . 



AL 


A&anii 


BS 


Spun 


LS 


Uaotho 


SI 


Slovenia 


AM 


Armenia 


Fl 


Knhmd 


LT 




sx 


Slovakia 


AT 




FK 




LU 


Luxembourg 


SN 


Senega] 


AU 


Australia 


GA 


Gabon 


LV 


Latvia 


SZ 


Swaziland 


AZ 


AxerbeijMi 


GB 


United Kingdom 


MC 




TD 


Chad 


EA 


Bosnia and Henegovrna 


GB 


Georgia 


MD 


Republic of Moldova 


TC 


Togo 


BB 


Barbados 


GH 


Ghana 


MC 




TJ 


Tapkhuo 


BB 


Belgian) 


GN 


Guinea 


MK 


The former Yugoslav 


TM 


Turk me nman 


BP 


Burkina Faso 


GR 


Greece 




Repubbc of Macedonia 


TR 


Turkey 


BG 


Bulgaria 


HU 


Hungary 


ML 


Mali 


TT 


Trinidad and Tobago 


BJ 




IB 


Ireland 


MN 


Mongolia 


UA 


Ukraine 


BR 


BiszD 


XL 




MR 




UG 


Uganda 


BY 


Belarus 


IS 


Iceland 


MW 


Malawi 


US 


United States of America 


CA 


Canada 


IT 


Italy 


MX 


Mexico 


UZ 


Uzbekistan 


CF 


Central Africm Republic 


JP 


Japan 


NE 


Nipr 


VN 


Viet Nam 


CC 


Congo 


KE 


Kenya 


NL 


ricuiefianos 


YU 


Yugoslavia 


CH 


Swkteriand 


KG 


Kytgyzitsn 


NO 


Norway 


ZYV 


Zimbabwe 


a 


Cote dTvoire 


KP 


Democratic People'i 


KZ 


New Zealand 






CM 






Republic of Korea 


PL 


Poland 






CN 


China 


KR 


Republic of Korea 


PT 


Portugal 






CU 


Cuba 


KZ 




RO 


Romania 






CZ 


drrii Republic 


LC 


Saint Loch 


RU 


Russian Federation 






D£ 


Germany 


U 




SD 


Sudan 






DK 


Denmark 


LK 


Sri Lanka 


SE 


Sweden 






EE 


Estonia 


LR 


Libera 


sc 


Singapore 







(0 2003 Copyright Derwent Information Ltd. 



WO 00/53758 



PCT/US00/0S841 



COMPOSITIONS AND METHODS FOR THE TREATMENT OF IMMUNE RELATED DISEASES 



Field of the Invention 

5 The present invention relates to compositions and methods Cor the diagnosis and treatment of immanr 

related diseases. 

Background of the Invention 
Immune related and inflammatory diseases are the manifestation or consequence of fairly complex. 
10 often multiple interconnected biological pathways which in normal physiology are critical to respond to insult 
or injury, initiate repair from insult or injury, and mount innate and acquired defense against foreign organisms. 
Disease or pathology occurs when these normal physiological pathways cause additional insult or injury either 
as directly related to the intensity of the response, as a consequence of abnormal regulation or excessive 
stimulation, as a reaction to self, or as a combination of these. 
15 Though the genesis of these diseases often involves multistep pathways and often multiple different 

biological systems/pathways, intervention at critical points in one or more of these pathways can have an 
ameliorative or therapeutic effect. Therapeutic intervention can occur by either antagonism of a detrimental 
process/pathway or stimulation of a beneficial process/pathway. 

Many immune related diseases are known and have been extensively studied. Such diseases include 
20 immune-mediated inflammatory di seases, non-immune-mediated inflammatory diseases, infectious diseases, 
immunodeficiency riisrasrs. neoplasia, etc. 

T lymphocytes (T cells) are an important component of a immune response. T cells 

recognize antigens which are associated with a self-molecule e ncoded by genes within the major 
histocompatibility complex (MHC). The antigen may be displayed together with MHC molecules on the 
25 surface of antigen presenting cells, virus infected cells, cancer cells, grafts, etc. The T cell system eliminates 
these altered ceils which pose a health threat to the host mammal. T cells include helper T cells and cytotoxic 
T cells. Helper T cells proliferate extensively following recognition of an antigen -MHC complex on an 
. antigen presenting celL Helper T cells also secrete a variety of cytokines. Le.. lymphokines. which play a 
central role in the activation of B cells, cytotoxic T cells and a variety of other cells which participate in the 
30 immune response. 

A central event in both humoral and cell i™di*r»H immune re s po n ses is the activation and clonal 
expansion of helper T cells. Helper T cell activation is initiated by the imeracnon of the T cell receptor (TCR) 
- CD3 complex with an antigen-MHC on the surface of an antigen pr esenting cell This interaction mediates a 
cascade of biochemical events that induce the resting helper T cell to enter a cell cycle (the GO to Gl 

35 transition) and results in the expression of a high affinity receptor for IL-2 and sometimes IL-4. The activated 
T cell progresses through the cycle proliferating and differentiating into memory cells or e Sector cells. 

In addition to the signals mediated through the TCR, activation of T ceils involves additional 
costimulation in d uced by cytokines released by the antigen presenting cell or thro ugh interactions with 
membrane bound molecules on the antigen presenting cell and the T cell The cytokines IL-1 and EL-6 have 

40 been shown to provide a costhnnlatory signal Also, the interaction between die B7 molecule expressed on the 
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surface of an aniigcn presenting cell and CD28 and CTLA-4 molecules expressed on the T cell surface ettect 1 
cell activation. Activated T cells express an increased number of cellular adhesion molecules, such as ICAM- 
1 , integrins, VLA-4, LFA- 1 , CD56. etc. 

T-cell proliferation in a mixed lymphocyte culture or mixed lymphocyte reaction (MLR) is an 
5 established indication of the ability of a compound to stimulate the immune system. In many immune 
responses, inflammatory cells infiltrate the site of injury or infection. The migrating cells may be neutrophilic 
eosinophilic, monocytic or lymphocytic as can be determined by histologic examination of me affected tissues. 
Current Protocols in Immunology, ed John E. Coligan. 1994. John Wiley & Sons. Inc. 

Immune related diseases can be treated by suppressing the immune response. Using neutralizing 
10 antibodies that inhibit molecules having immune stimulatory activity would be beneficial in the treatment of 
immune-mediated and inflammatory diseases. Molecules which inhibit the immune response can be utilized 
(proteins directly or via the use of antibody agonists) to inhibit the immune response and thus ameliorate 
immune related disease. 

15 Summary of the Invennon 

The present invention concerns compositions and methods lor the diagnosis and treatment of immune 
related disease in mammals, including humans. The present invention is based on the identification of proteins 
(including agonist and antagonist antibodies; which either stimulate or inhibit the immune response in 
mammals. Immune related diseases can be treated by suppressing or enhancing the immune response. 

20 Molecules that enhance the immune response stimulate or potentiate the immune response to an antigen- 
Molecules which stimulate the immune response can be used therapeutically where enhancement of the 
immune response would be beneficial. Such stimulatory molecules can also be inhibited where suppression of 
the immune response would be of value. 

Neutralizing antibodies are examples of molecules that inhibit molecules having immune stimulatory 

25 acuvity and which would be beneficial in the treatment of immune related and inflammatory diseases. 
Molecuies which inhibit the immune response can also be utilized (proteins directly or via the use of antibody 
agonists) to inhibit the immune response and thus ameliorate immune related disease. 

Accordingly, the PRO polypeptides and anti-PRO antibodies and fragments thereof arc useful for the 
diagnosis and/or treatment (including prevention) of immune related di s eas e s. Antibodies which bind to 

30 stimulatory proteins are useful to suppress the immune system and the immune response. Antibodies which 
bind to inhibitory proteins are useful to <nmn|»™> the mmime system and the immune response. The PRO 
polypeptides and anti-PRO antibodies also useful to prepare medicines and medicaments for the treatment of 
immune related and inflammatory diseases. 

In one embodiment the invention provides for isolated nucleic acid molecules comprising nucleotide 

35 sequences that encodes a PRO polypeptide. 

In one aspect, the isolated nucleic acid molecule comprises a nucleotide sequence having at least 
about 80% nucleic acid sequence identity, alternatively at least about 8 1 % nucleic acid identity, alternatively at 
least about 82% nucleic acid sequence identity, alternatively at least about 83% nucleic acid sequence identity, 
alternatively at least about 84% nucleic acid sequence identity, alternatively at least about 85% nucleic acid 

40 sequence identity, alternatively at least about 86% nucleic acid sequence identity, alternatively at least about 
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87% nucleic acid sequence identity, alternatively at least about 88% nucleic acid sequence identity, 
aitematively at least about 89% nucleic acid sequence identity, alternatively at least about 90% nucleic acid 
sequence identity, alternatively at least about 91% nucleic acid sequence identity, aitematively at least about 
92% nucleic acid sequence identity, alternatively at least about 93% nucleic acid sequence identity, 
alternatively at least about 94% nucleic acid sequence identity, alternatively at least about 95% nucleic acid 
sequence ideality, alternatively at least about 96% nucleic acid sequence identity, alternatively at least about 
97% nucleic acid sequence identity, alternatively ai least about 98% nucleic acid sequence idenuty and 
alternatively at least about 99% nucleic acid sequence identity to (a) a DNA molecule encoding a PRO 
polypeptide having a full-length amino acid sequence as disclosed herein, an amino acid sequence lacking the 
signal peptide as disclosed herein, an extracellular domain of a transmembrane protein, with or without die 
signal peptide, as disclosed herein or any other specifically defined fragment of the full-length amino acid 
sequence as disclosed herein, or (b) the complement of the DNA molecule of (a). 

In other aspects, the isolated nucleic acid molecule comprises a nucleotide sequence having at least 
about 80% nucleic acid sequence identity, alternatively at least about 81% nucleic acid sequence ia^ntity. 
15 alternatively at least about 82% nucleic acid sequence identity, aitematively at least about 83% nucieic acid 
sequence identity, alternatively at least about 84% nucleic acid sequence identity, alternatively at least about 
85% nucleic acid sequence identity, alternatively at least about 86% nucleic acid sequence identity, 
alternatively at least about 87% nucleic acid sequence identity, alternatively at least about 88% nucleic acid 
sequence identity, alternatively at least about 89% nucleic acid sequence identity, alternatively at least about 
20 90% nucleic acid sequence identity, alternatively at least about 91% nucleic acid sequence identity, 
alternatively at least about 92% nucleic acid sequence identiiy, alternatively at least about 93% nucleic acid 
sequence identity, alternatively at least about 94% nucleic acid sequence identity, aitematively at least about 
95% nucleic acid sequence identity, alternatively at least about 96% nucleic acid sequence identity, 
alternatively at least about 97% nucleic acid sequence identity, alternatively at least about 98% nucleic acid 
25 sequence identity, aliernanvely at least about 99% nucleic acid sequence identity to (a) a DNA molecule 
comprising the coding sequence of a full-length PRO polypeptide cDNA as disclosed herein, the coding 
sequence of a PRO polypeptide lacking the signal peptide as disclosed herein, the coding sequence of an 
extracellular domain of a transmembrane PRO polypeptide, with or without the signal peptide, as disclosed 
herein or the coding sequence of any other specifically defined fragment of the full-length amino acid sequence 
30 as disclosed herein, or (b) the complement of the DNA molecule of (a). 

In a timber aspect, the invention concerns an isolated nucleic acid molecule comprising a nucleotide 
sequence having at least about 80% nucleic acid sequence identity, alternatively at least about 8 1% nucleic acid 
sequence identity, aitematively at least about 82% nucieic acid sequence identity, aitematively at least about 
83% nucleic acid sequence identity, alternatively at least about 84% nucieic acid sequence identity, 
35 alternatively at least about 85% nucleic acid sequence identity, aitematively at least about 86% nucleic acid 
sequence identity, alternatively at least about 87% nucleic acid sequence identity, alternatively at least about 
88% nucleic acid sequence identity, alternatively at least about 89% nucleic acid sequence identity, 
alternatively at least about 90% nucleic acid sequence identity, alternatively at least about 91% nucleic acid 
sequence identiiy, aitematively at least about 92% nucleic acid sequence identity, alternatively at least about 
93% nucleic acid sequence identity, alternatively at least about 94% nucleic acid sequence identity, 



40 
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alternatively at least about 95% nucleic acid sequence identity, alternatively at least about 96% nucleic acid 
sequence identity, alternatively at least about 97% nucleic acid sequence identity, alternatively at least about 
98% nucleic acid sequence identity, alternatively at least about 99% nucleic acid sequence identity to (a) a 
DNA molecule that encodes the same mature polypeptide encoded by any of the human protein cDNAs 
5 deposited with the ATCC as disclosed herein- or (b) the complement of the DNA molecule of (a). 

In another aspect the invention provides for isolated nucleic acid molecule comprising a nucleoude 
sequence encoding a PRO polypeptide with is cither transmembrane domain-deleted or transmembrane 
domam- inactivated, or is complementary to such encoding nucleotide sequence, wherein the transmembrane 
domainfs) of such polypeptides are disclosed herein. Therefore, soluble extracellular domains of the herein 

1 0 described PRO polypeptides are contemplated. 

Another embodiment is directed to fragments of a PRO polypeptide coding sequence, or die 
complement thereof, that may find use as. for example, hybridization probes, for encoding fragments of a PRO 
polypeptide that may optionally encode a polypeptide comprising a binding site for an anti-PRO polypeptide 
antibody or as antisense oligonucleotide probes. Such nucleic acid fragments are usually at least about 20 

1 5 nucleotides in length, alternatively at least about 30 nucleotides in length, alternatively at least about 40 
nucleotides in length, alternatively at least about 50 nucleotides in length, alternatively at least about 60 
nucleotides in length, alternatively at least about 70 nucleotides in length, alternatively at least about 80 
nucleotides in length, alternatively at least about 90 nucleotides in length, alternatively at least about 100 
nucleotides in length, alternatively at least about 110 nucleotides in length, alternauvely at least about 120 

20 nucleotides in length, alternatively at least about 130 nucleotides in length, alternatively at least about 140 
nucleotides in length, alternatively at least about 150 nucleotides in length, alternatively at least about 160 
nucleoudes in length, alternatively at least about 170 nucleotides in length, alternatively at least about 180 
nucleotides in length, alternatively at least about 190 nucleotides in length, alternatively at least about 200 
nucleotides in length, alternatively at least about 250 nucleotides in length, alternatively at least about 300 

25 nucleotides in length, alternatively at least about 350 nucleotides in length, alternatively at least about 400 
nucleotides in length, alicrnatively at least about 450 nucleotides in length, altemanvely at least about 500 
nucleotides in length, alternauvely at least about 600 nucleoudes in length, alternatively at least about 700 
nucleotides in length, alternatively at least about 800 nucleotides in length, alternatively at least about 900 
nucleotides in length, alternatively at least about 1000 nucleoudes in length, alternatively at least about 1500 

30 nucleoude in length, alternauvely at least about 2000 nucleotides in length, alternatively at least about 2500 
nucleoude in length, altemanvely at least about 3000 nucleotide in length, alternatively at least about 4000 
nucleotide in length, alternatively at least about 5000 nucleotides in length, or more, wherein in this context the 
term "about" means the referenced nucleotide sequence length plus or minus 10% of that referenced length, it 
is noted that novel fragments of a nucleotide sequence encoding the respective PRO polypeptide may be 

35 determined in a routine manner by aligning the respective nucleotide encoding a PRO polypeptide with other 
known nucleoude sequences using any of a number of well known sequence alignment programs and 
determining which nucleoude sequence fragments) are novel. All such nucleoude sequences encoding the 
respective PRO polypeptides are contemplated herein. Also contemplated are the nucleotide molecules which 
encode fragments of the PRO polypeptides, preferably those polypeptide fragments that comprise a binding site 

40 for an anti-PRO polypeptide antibody. 
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In another embodiment the invention provides isolated PRO polypeptides encoded by any of the 
isolated nucleic acid sequences hereinabove identified. 

In a certain aspect, the invention concerns an isolated PRO porypeptide. comprising an amino acid 
sequence having at least about 80% amino acid sequence identity, alternatively at least about 81% amino acid 

5 sequence idenriry, alternatively at least about 82% amino acid sequence identity, alternatively at least about 
83% amino acid sequence identity, alternatively at least about 84% ammo acid sequence identity, alternatively 
ax least about 85% amino acid sequence identity, alternatively at least about 86% amino acid sequence identity, 
alternatively at least about 87% amino acid sequence identity, alternatively at least about 88% amino acid 
sequence identity, alternatively at least about 89% amino acid sequence identity, alternatively at least about 

1 0 90% amino acid sequence identity, alternatively at least about 9 1% amino acid sequence identity, alternatively 
at least about 92% amino acid sequence identity, alternatively at least about 93% amino acid sequence identity, 
alternatively at least about 94% amino acid sequence identity, alternatively at least about 95% amino acid 
sequence identity, alternatively at least about 96% amino acid sequence identity, alternatively at least about 
97% amino acid sequence identity, alternatively at least about 98% amino acid sequence identity, alternatively 

15 ai least about 99% ammo acid sequence identity to a PRO polypeptide having a full-length amino acid 
sequence as disclosed herein, an amino acid sequence lacking the signal ^>eptide as disclosed herein, an 
extracellular domain of a transmembrane protein, with or without the signal peptide, as disclosed herein or any 
other specifically defined fragment of die full-length amino acid sequence as disclosed herein. 

In a further aspect, the invention concerns an isolated PRO polypeptide comprising an amino acid 

20 seque nc e having at least about 80% amino acid sequence identity, alternatively at least about 81% amino acid 
sequence identity, alternatively at least about 82% amino acid sequence identity, alternatively at least about 
83% amino acid sequence identity, alternatively at least about 84% amino acid sequence identity, alternatively 
at least about 85% amino acid sequence identity, alternatively at least about 86% amino acid sequence identity, 
alternatively at least about 87% ammo acid sequence identity, alternatively at least about 88% amino acid 

25 sequence identity, alternatively at least about 89% ammo acid sequence identity, alternatively at least about 
90% ammo acid sequence identity, alternatively at least about 91% amino acid sequence identity, alternatively 
at least about 92% amino acid sequence identity, alternatively at least about 93% amino acid sequence identity, 
alternatively at least about 94% amino acid sequence identity, alternatively at least about 95% amino acid 
sequence identity, alternatively at least about 96% amino acid sequence identity, alternatively at least about 

30 97% amino acid sequence identity, alternatively at least about 98% amino acid sequence identity, alternatively 
at least about 99% amino acid sequence identity to an amino acid seque n ce enr ori r ri by any of the human 
protein cDNAs deposited with the ATCC as disclosed herein. 

In a further aspect, the invention concerns an isolated PRO polypeptide comprising an amino acid 
sequence scoring at least about 80% positives, alternatively at least about 81% positives, alternatively at least 

35 about 82% positives, alternatively at least about 83% positives, alternatively at least about 84% positives, 
alternatively at least about 85% positives, alternatively at least about 86% positives, alternatively at least about 
87% positives, alternatively at least about 88% positives, alternatively at least about 89% positives, 
alternatively at least about 90% positives, alternatively at least about 91% positives, alternatively at least about 
92% positives, alternatively at least about 93% positives, alternatively at least about 94% positives, 

40 alternatively at least about 95% positives, alternatively at least about 96% positives, alternatively at least about 
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97% positives, alternatively at least about 98% positives, alternatively at least about 99% positives when 
compared with the amino acid sequence of a PRO polypeptide having a full-length amino acid sequence as 
disclosed herein, an amino acid sequence lacking the signal peptide as disclosed herein, an extracellular 
domain of a transmembrane protein, with or without the signal peptide, as disclosed herein or any other 
specifically defined fragment of the full-length amino acid sequence as disclosed herein. 

In a specific aspect, the invention provides an isolated PRO polypeptide without the N-terminal signal 
sequence and/or the initiating methionine and is encoded by a nucleotide sequence that encodes such an amino 
acid sequence as hereinbefore described. Processes for producing the same are also herein described, wherein 
those processes comprise culruring a host cell comprising a vector which comprises the appropriate encoding 
nucleic acid molecule under conditions suitable for expression of the PRO polypeptide and recovering the same 
from the cell culture. 

In another aspect, the invention provides an isolated PRO polypeptide which is either transmembrane, 
deleted or transmembrane domain-inactivated Processes for producing the same are also herein described, 
wherein those processes comprise culruring a host cell comprising a vector which comprises the appropriate 
encoding nucleic acid molecule under conditions suitable for expression of the PRO polypeptide and 
recovering the PRO polypeptide from the cell culture. 

in another embodiment, the invention provides vectors comprising DNA encoding any of the PRO 
polypeptides. Host cells comprising any such vector arc also provided. By way of example, the host cells may 
be CHO cells. £. coli or yeast. A process for producing any of the herein described polypeptides is further 
provided and comprises culruring host cells under conditions suitable for expression of the desired 
polypeptides and recovering the desired polypeptide from the cell culture. 

In other embodiments, the invention provides chimeric molecules comprising any of the herein 
described polypeptides fused to a heterologous polypeptide or amino acid sequence. Examples of such 
chimeric molecules comprise any of the herein described polypeptides fused to an epitope tag sequence or a Fc 
25 region of an immunoglobulin. 

In yet other embodiments, the invention provides oligonucleotide probes useful for isolating genomic 
and cDN A nucleotide sequences or as antisense probes, wherein those probes may be derived from any of the 
above or below described nucleotide sequences. 

In yet another embodiment, the invention concerns agonists and antagonists of the PRO polypeptides, 
that mimic or inhibit one or more functions or activities of the PRO polypeptides. In a particular embodiment, 
the agonist or antagonist is an antibody that binds to the PRO polypeptides or a small molecule. 

In another embodiment the invention provides an antibody which specifically binds to any of the 
above or below described polypeptides. Optionally, the antibody is a monoclonal antibody, humanized 
antibody, antibody fragment or single-chain antibody. In one aspect, the present invention concerns an isolated 
antibody which binds a PRO polypeptide. In another aspect, the antibody mimics the activity of a PRO 
polypeptide (an agonist antibody) or conversely the antibody inhibits or neutralizes the activity of a PRO 
polypeptide (an antagonist antibody). In another aspect, the antibody is a monoclonal antibody, which 
preferably has nonhuman complementarity determining region (CUR.) residues and human framework region 
(FR) residues. The antibody may be labeled and may be immobilized on a solid support. In a further aspect, 
the antibody is an antibody fragment, a monoclonal antibody, a single-chain antibody, or an anti-idiotypic 
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antibody. 

In a further embodiment the invention concerns a method of identifying agonists or antagonists to a 
PRO polypeptide which comprises contacting the PRO polypeptide with a candidate molecule and monitoring 
a biological activity mediated by said PRO polypeptide. Preferably, the PRO polypeptide is a native sequence 
PRO polypeptide. 

In another embodiment, the invention concerns a composition of matter containing PRO polypeptide 
or an agonist or antagonist antibody which binds the polypeptide in admixture with a carrier or excrpicnt In 
one aspect, the composition contains a therapeutically effective amount of the peptide or antibody. In annrfuf 
aspect, when the composition contains an immune stimulating molecule, the composition is useful for (a) 
increasing infiltration of inilammatory cells into & tissue of a mammal in need thereof, (b) stimulating or 
enhancing an immune response in a mammal in need thereof, or (c) increasing me proliferation of T- 
lymphocyies m a mammal in need thereof in response to an antigen In a further aspect, when the composition 
contains an immune inhibiting molecule, the composition is useful for (a) decreasing infiltration of 
inflammatory cells into a tissue of a mammal in need thereoC (b) inhibiting or reducing an immune response 
in a mammal in need thereof, or <c) decreasing the proliferation of T- lymphocytes in a mammal in need thereof 
m response to an antigen. In another aspect the composition contains a further active ingredient which may. 
for example, be a runner antibody or a cytotoxic or chemotherapeuiic agent Preferably, the composition is 
sterile. 

In another embodiment the invention concerns the use of the polypeptides and antibodies of the 
invention to prepare a composition or medicament which has the uses described above. 

In a mnher embodiment the invention concerns nucleic acid encoding an anti-PRO200. anri-PRO204, 
anri-PR0212. ami-PR02l6. anti-PR0226. anti-PRO240. anu-PR0235. anti-PR0245. anti-PROI72. anti- 
PR0273. ami-PR0272. anti-PR0332, ami-PR0526, anti-PR0701. ami-PR036l. anti-PR0362. anti-PR0363. 
anti-PR0364. anri-PR0356. anu-PR053l. ami«PR0533. anti-PROl083. anti-PR0865. anti-PRO770. ami- 
PR0769. anu-PR0788. anti-PROI 1 14. anu-PROI007. anti-PR0M84. anti-PROIOjl. anu'-PR01346, anti- 
PROU55. ann-PRO1250. anti-PRO!312. anti-PROU92. anu-PR01246. anu-PROI2S3. anti-PROI 195. anti- 
PR01343. ami-PRO!4l8. anti-PR01387. ami-PROI4IO. anti-PR01917. ami-PROI868. ami-PRO205. ami- 
PR021. anu-PR0269. ami-PR0344. anu-PR0333. ano-PR038L ami-PRO720, anu-PROR66, anti-PRO840, 
anu-PR0982. anti-PR0836. anti-PROI 159. ami-PR01358, anti-PROI 3 25. anti-PRO!338, anti-PR01434. 
ami-PR04333. anti-PRO4302. anti-PRO4430 or anu-PR05727 antibody, and vectors and recombinant host 
cells comprising such nucleic acid. In a still further embodiment the invention concerns a method for 
producing such an antibody by culturing a host cell transformed with nucleic acid encoding the antibody under 
conditions such that the antibody is expressed, and recovering the antibody from the cell culture. 

In a further embodiment the invention concerns an isolated nucleic acid molecule that hybridizes to 
the a nucleic acid molecule encoding a PRO polypeptide, or the complement thereof. The nucleic acid 
preferably is DNA. and hybridization preferably occurs under stringent conditions. Such nucleic acid 
mo lecules can act as ami sense molecules of the "»pKfffd genes identified herein, which, in turn, can find use 
in the modulation of the respective amplified genes, or as antisense primers m amplification reactions. 
Furthermore, such sequences can be used as part of ribozyme and/or triple helix sequence which, in turn, may 
be used in regulation of the amplified genes. 
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In another embodiment the invention concerns a method for determining the presence of a PRO 
polypeptide comprising exposing a cei! suspected of containing and/or expressing the polypeptide to an anti- 
PRO200, ami-PRO204, anti-PR02l2. anb-PR0216. anti-PR0226\ ami-PRO240, anri-PR0235, anti-PR0245, 
anri-PR0172, anti-PR0273. ana-PR027:>. ami-PR0332. ami*PR0526, anti-PRO701. anti-PR036U ami- 
PR0362, ami-PR0363. anri-PR0364. anti-PR0356. ami-PR0531, anti-PR0533, anti-PRO1083. anb-PR0865, 
ami-PRO770. anti-PR0769. anti-PR0788. anti-PRO 1 1 14. anri-PRO1007, anti-PROU84 ? anti-PRO1031. ami- 
PR01346, anti-PROl 155, anti-PRO 1 250. anti-PROi312. anti-PROI192. ami-PROl246, anti-PROl283, ami- 
PROH95, anti-PRO!343. ami-PR01418. anti-PROl387, ami-PROI410. anti-PR019l7. anti-PR01868. ann- 
PRO205. ami-PR021. anti-PR0269. anti-PR0344. ann-PR0333. anti-PR038l. ami-PRO720. ami-PR0866\ 
anti-PRO840, awi-PR0982. anti-PR0836. ano-PROl 159. anti-PR01358, anu-PR01325. ami-PROi338. and. 
PRO 1434, anti-PR04333. ami-PRO4302. ami-PRO4430 or atu>PR05727 antibody and deterniining binding 
of the antibody to the ceil. 

In yet another embodiment, the present invention concerns a method of diagnosing an immune related 
disease in a mammal, comprising detecting the level of expression of a gene encoding a PRO polypeptide fa) in 
a test sample of tissue cells obtained from the mammal, and <b) in a control sample of known normal tissue 
ceils of the same cell type wherein a higher or lower expression level in the test sample as compared to die 
control sample indicates the presence of immune related disease in the mammal from which the test tissue cells 
were obtained. 

In another embodiment, the present invention concerns a method of diagnosing an immune disease in 
a mammal, comprising (a) conuctmg on ann-PRO polypeptide antibody with a test sample of tissue cells 
obtained from the mammal and (b) detecting the formation of a complex between the antibody and the 
respective PRO polypeptide, respectively, in the test sample; wherein the formation of said complex is 
indicative of the presence or absence of said disease. The detection may be qualitative or quantitative, and may 
be performed in comparison with monitoring the complex formation in a control sample of known normal 
tissue cells of the same cell type. A larger quantity of complexes formed in the test sample indicates the 
presence or absence of an immune disease in the mammal from which the test tissue cells were obtained. The 
antibody preferably carries a detectable labeL Complex formation can be monitored, for example, by light 
microscopy, flow cytometry, fluorimcrry. or other techniques known in the art. The test sample is usually 
obtained from an individual suspected of having a deficiency or abnormality of the immune system. 

In another embodiment, the present invention concerns a diagnostic kit, containing an anri-PRO200. 
ami-PRO204, ami-PR0212, anti-PR0216. anu-PR0226. anti-PRO240, anti-PR0235. anti-PR0245, anb- 
PROI72. anti-PR0273, anb-PR0272. am>PR0332. ami-PR0526. ami-PRO70U anu-PR0361. ami-PR0362, 
ami-PR0363. am>PR0364. anti-PR0356. anti-PR0531. anti-PR0533. ami-PRO1083. anti-PR0865, anu- 
PRO770. anti-FR0769, anti-PR0788. anri-PROHI4 ; anti-PRO1007, anti-PROl 184, anri-PROI031, anh- 
PR01346. anti-PROl 155, anti-PRO1250, anu-PROI312. anri-PR01192, anti-PR01246. anti-PR01283, anti- 
PROl 195. anri-PR01343, anti-PR014!8, anti-PR01387. anti-PRO1410. anti-PR01917. ami-PR01868. anri- 
PR02Q5. ann-PR02I > anti-PR0269, anti.PR0344, ami-PR0333, anri-PR0381, anri-PRO720, anri-PR0866. 
am>PRO840, anti-PR0982. anti-PR0836. ami-PROi 159, anti-PRO 1358, anti-PR01325, anri-PR01338, anti- 
PR01434, ann-PR04333, ami-PRO4302, anb-PRO4430 or anti-PR05727 antibody and a carrier a 
buffer) in suitable packaging. The kit preferably contains instrucrjons for using the antibody to detect the PRO 
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polypeptide. 

Id a further embodiment, the invention concerns an article of manufacture, comprising: 
a container; 

an instruction on the container and 

a composition comprising an active agent contained within the container, wherein the composition is 
effective for stimulating or inhibiting an immune response in a mammaL the instruction on the container 
indicates that the composition can be used to treat an immune related disease, and the active agent in the 
composition is an agent stimulating or inhibiting the expression and/or activity of the PRO polypeptide. In a 
preferred aspect, the active agent is a PRO200. PRO204. PR0212. PR0216. PR0226. PRO240. PR0235. 
PR0245. PR0172. PR0273, PR0272. PR0332. PR0526. PRO701, PR0361. PR0362. PR0363. PR0364. 
PR0356. PR0531, PR0533. PRO1083. PR0865. PRO770. PR0769. PR0788. PROl 114. PRO1007. 
PR01184. PRO1031. PR01346. PROl 155. PRO1250, PR01312. PROl 192. PR01246. PR01283. PROI 195. 
PR01343. PR01418. PR01387, PRO1410, PR01917. PR01868. PRO205. PR02K PR0269, PR0344. 
PR0333. PR0381. PRO720. PRO866. PRO840. PR0982. PR0836, PROl 159. PR01358. PR01325. 
15 PR01338. PROU34. PR04333. PRO4302. PRO4430 or PR05727 polypeptide or an am>PRO200. anti- 
PRO204. anti-PR0212. ami-PR0216. anu-PR0226. anu-PRO240. anti-PR0235. anii-PR0245. anti-PROI72. 
anti-PR0273. anti-PR0272. anti-PR0332. anti-PR0526. anti-PRO701. amt-PR036l. anu-PR0362. ami- 
PR0363. anti-PR0364. anti-PR0356> anti-PR0531. anti-PR0533, anti-PRO1083. anti-PR0865. anti-PRO770> 
anti-PR0769. anti-PR0788. ami-PROl ! 14. anti-PRO1007. anti-PR01184. anti-PROI031. anti-PR01346\ 
20 anti-PROI 155. anti-PRO1250. anti-PR01312, anti-PROI 192. ami- PRO 1 246. anti-PR01283. anti-PR01195. 
anti-PROI343. anti-PR0l4!8. anti-PROI387. anti-PRO1410. anti-PR0l9l7. anti-PROI868. anti-PRO205, 
am>PR02I. anti-PR0269. anu-PR0344. anii.PR0333. anti-PR0381. anti-PRO720. anti-PR0866. anti- 
PRO840. ami-PR0982. anti-PR0836, anti-PROl 159. anti-PRO 1 358. anti-PRO 1325. anti-PR01338. anti- 
PR01434, ami-PR04333. anti-PRO4302. anti-PRO4430 or ami-PR05727 antibody. 
25 A further embodiment is a method for identifying a compound capable of inhibiting the expression 

and/or activity of a PRO polypeptide by contacting a candidate compound with a PRO polypeptide under 
conditions and for a ume sufficient to allow these two components to interact. In a specific aspect, either the 
candidate compound or the PRO polypeptide is immobilized on a solid support In another aspect, the non- 
immobilized component carries a detectable label. 

Another embodiment of the present invention is directed to the use of a PRO polypeptide, or an 
agonist or antagonist thereof as hereinbefore described, or an anti-PRO antibody, for die preparation of a 
medicament useful in the treatment of a condition which is responsive to the PRO polypeptide, an agonist or 
antagonist thereof or an anti-PRO antibody, 

35 Brief Description of the Drawmes 

Figure 1 shows DNA29 10 1 -1 276 f SEQ ED NO:l). 

Figure 2 shows the native sequence PRO200 polypeptide UNQ174 (SEQ ID NO:2). 
Figure 3 shows DNA30871-1 157 (SEQ ID NO:I I). 

Figure 4 shows the native sequence partial length PRO204 polypeptide UNQ1 78 (SEQ ID NO: 12) . 
40 Figure 5 shows DNA30942-1 134 (SEQ ID NO:13). 
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Figure 6 shows the native sequence PR02I2 polypeptide UNO i 86 <SEQ ID NO: 14). 
Figure 7 shows DNA33087- 1 1 58 (SEQ ID NO: 1 8). 

Figure 8 shows the native sequence PR0216 polypeptide UNQ190 (SEQ ID NO: 1 9). 
Figure 9 shows DNA33460- 1 1 66 (SEQ ID NO:20). 

Figure 10 shows the native sequence PR0226 polypeptide UNQ200 (SEQ ID NO:21). 
Figure 1 1 shows DNA34387-I 138 (SEQ ID NO:25). 

Figure 12 shows the native sequence PRO240 polypeptide UNQ2I4 (SEQ ID NO:26). 
Figure 13 shows DNA35558-U67 (SEQ ID NO:30). 

Figure 14 shows the native sequence PR0235 polypeptide UNQ209 (SEQ ID NO:31). 
Figure 15 shows DNA35638-1 141 (SEQ IDNO:35). 

Figure 16 shows the native sequence PR0245 polypeptide UNQ219 (SEQ ID NO:36). 
Figure 17 shows DNA35916-I 161 (SEO IDNO:40). 

Figure 18 shows the native sequence PR0172 polypeptide UNQ146 (SEQ ID NO:41). 
Figure 1 9 shows DNA39523-t 192 (SEQ ID NO:45). 

Figure 20 shows the native sequence PR0273 polypeptide UNO240 (SEQ ID NO:46). 
Figure 2 1 shows DNA40620-1 183 (SEQ ID NO:50). 

Figure 22 shows the native sequence PR0272 polypeptide UNQ239 (SEO ID NO:5I). 
Figure 23 shows DNA40982-1235 (SEQ ID NO:56). 

Figure 24 shows the native sequence PR0332 polypeptide UNQ293 (SEQ ID NO:57). 
Figure 25 shows DNA44I84-1319 (SEQ ID NO:6l). 

Figure 26 shows the native sequence PR0526 polypeptide UNQ330 (SEQ ID NO:62). 
Figure 27 shows DNA44205-I285 (SEQ ID NO:66). 

Figure 28 shows the native sequence PRO70I polypeptide UNQ365 (SEQ ID NO:67). 
Figure 29 shows DNA454 1 0-1250 (SEQ ID NO:7I). 

Figure 30 shows the native sequence PR0361 polypeptide UNQ316 (SEQ ID NO:72). 
Figure 31 shows DNA454I6-125I (SEQ ID NO: 79). 

Figure 32 shows the native sequence PR0362 polypeptide UNQ317 (SEQ ID NO:80). 
Figure 33 shows DNA454I9- 1252 (SEQ ID NO:86). 

Figure 34 shows the native sequence PR0363 polypeptide UNQ3I8 (SEQ ID NO:87). 
Figure 35 shows DNA47365-1206 (SEQ ID NO:91). 

Figure 36 shows the native sequence PR0364 polypeptide UNQ319 (SEQ ID NO:92). 
Figure 37 shows DNA47470- 1 130 (SEQ ID NO: 101 ). 

Figure 38 shows the native sequence PR0356 polypeptide UNQ313 (SEQ ID NO:102). 
Figure 39 shows DNA483 14-1320 (SEQ ID NO: 106). 

Figure 40 shows the native sequence PR053 1 polypeptide UNQ332 (SEQ ID NO: 1 07). 
Figure4I shows DNA49435- 1219 (SEQ ID NO: 111). 

Figure 42 shows the native sequence PROS33 polypeptide UNQ334 (SEQ ID NO: 1 12). 
Figure 43 shows DNA5092M458 (SEQ ID NO:116). 

Figure 44 shows the native sequence PRO1083 polypeptide UNQ540 (SEQ ID NO:l 17). 
Figure 45 shows DNA53974- 1 40 1 (SEQ ID NO: 123). 
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Figure 46 shows the native sequence PR0865 polypeptide UNQ434 (SEQ H> NO: 124). 
Figure 47 shows DNA54228-I366(SEQ ZD NO:133). 

Figure 48 shows the native sequence PRO770 polypeptide UNQ408 (SEQ ID NO: 134). 
Figure 49 shows DNA5423 1 - 1 366 (SEQ ED NO: 139). 

Figure 50 shows the native sequence PR0769 polypeptide UNQ407 (SEQ ID NO: 140). 
Figure 51 shows DNA5 6405- 1 357 (SEQ ID NO; 141). 

Figure 52 shows the native sequence PR0788 polypeptide UNQ430 (SEQ ID NO: 142). 
Figure 53 shows DNA57033-1403 (SEQ ID NO: 143). 

Figure 54 shows the native sequence PROl 1 14 polypeptide UNQ557 (SEQ ID NO: 144). 
Figure 55 shows DNA57690- 1374 (SEQ ID NO: 145). 

Figure 56 shows the native sequence PRO1007 polypeptide UNQ49 1 (SEQ ID NO: 1 46). 
Figure57 shows DNA59220-I514 (SEQ ID NO:147). 

Figure 58 shows the native sequence PROl 184 polypeptide UNQ598 (SEQ ID NO:148). 
Figure 59 shows DNA59294- 1 38 1 (SEO ID NO: 149). 

Figure 60 shows the native sequence PRO 1 03 1 polypeptide UNQ516 (SEQ ID NO: 150). 
Figure 61 shows DNA59776-I600 (SEQ ID NO: 151). 

Figure 62 shows the native sequence PR01346 polypeptide UNQ70I (SEQ ID NO:l52). 
Figure 63 shows DN A59849- 1 504 (SEQ ID NO: 156). 

Figure 64 shows the native sequence PROl 155 polypeptide UNQ585 (SEQ ID NO:157). 
Figure 65 shows DNA60775-1532 (SEQ ID NO: 158). 

Figure 66 shows the native sequence PRO 1250 polypeptide UNQ633 (SEQ ID NO: 159). 
Figure 67 shows DNA61873- 1 574 (SEQ ID NO: 160). 

Figure 68 shows the native sequence PRO 1312 polypeptide UNQ678 (SEQ ID NO: 1 61 ). 
Figure 69 shows DNA628I4-I521 (SEQ ID N0:I62). 

Figure 70 shows the native sequence PROl 192 polypeptide UNQ606 (SEQ ID NO: 163). 
Figure 71 shows DNA64885-1529 (SEQ ID NO: 167). 

Figure 72 shows the native sequence PRO 1246 polypeptide UNQ630 (SEQ ID NO: 168). 
Figure 73 shows DNA65404- 1 55 1 (SEQ ID NO: 1 69). 

Figure 74 shows the native sequence PRO 1 283 polypeptide UNQ653 (SEQ ID NO: 1 70). 
Figure 75 shows DNA65412-1523 (SEQ ID NO: 177). 

Figure 76 shows the native sequence PROl 195 polypeptide UNQ608 (SEQ ID NO: 178). 
Figure 77 shows DNA66675-1587 (SEQ ID NO: 1 79). 

Figure 78 shows the native sequence PRO 1343 polypeptide UNQ698 (SEQ ID NO: 180). 
Figure 79 shows DNA68864-1629 (SEQ ID NO: 184). 

Figure 80 shows the native sequence PROM 1 8 polypeptide UNQ732 (SEQ ID NO: 185). 
Figure 8 1 shows DNA68872-1 620 (SEQ ID NO: 1 86). 

Figure 82 shows the native sequence PR01387 polypeptide UNQ722 (SEQ ID NO:I87). 
Figure 83 shows DNA68874-1622 (SEQ ID NO: 188). 

Figure 84 shows the native sequence PRO 1410 polypeptide UNQ728 (SEQ ID NO:189). 
Figure 85 shows DNA76400-2528 (SEQ ID NO: 190). 
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Figure 86 shows the native sequence PRO 1 91 7 polypeptide UNQ900 (SEQ ID NO: 191). 
Figure 87 shows DNA77624-251S (SEQ ID NO:192). 

Figure 88 shows the native sequence PR01868 polypeptide UNQ859 (SEQ ID NO:l93). 
Figure 89 shows DNA30868- 1 156 (SEQ ID NO:228). 

Figure 90 shows the partial native sequence PRO205 polypeptide UNQI79 (SEQ ID NO:229). 
Figure 91 shows DNA36638-1056 (SEQ ID NO:230). 

Figure 92 shows the native sequence PR021 polypeptide UNQ2 1 (SEQ ID NO:23 i). 
Figure 93 shows DNA38260- 1 180 (SEQ ID N0032). 

Figure 94 shows the native sequence PR0269 polypeptide UNQ236 (SEQ ID NO:233). 
Figure 95 shows DNA40592-1242 (SEQ ID NO:240). 

Figure 96 shows the native sequence PR0344 polypeptide UNQ303 (SEQ ID NO:241 ). 
Figure 97 shows DNA4 1374-1312 (SEQ ID NO:248). 

Figure 98 shows the partial length native sequence PR0333 polypeptide UNQ294 (SEQ ID NO:249). 
Figure 99 shows DNA44 1 94- 1 3 1 7 f SEQ ID NO:250). 

Figure 100 shows the native sequence PR0381 polypeptide UNQ322 (SEQ ID NO:251 ). 
Figure 101 shows DNA535J7-1366 (SCO ID NO:255). 

Figure 102 shows the native sequence PRO720 polypeptide UNQ388 (SEQ ID NO:256). 
Figure 103 shows DNA53971-I359 (SEQ ID NO:257). 

Figure 104 shows the native sequence PRO866 polypeptide UNQ435 (SEQ ID NO:258). 
Figure 105 shows DNA53987-1438 (SEQ ID NO;266). 

Figure 106 shows the native sequence PRO840 polypeptide UNQ433 (SEQ ID NO:267). 
Figure 107 shows DNA57700-1408 (SEQ ID NO:268). 

Figure 108 shows the native sequence PR0982 polypeptide UNQ483 (SEQ ID NO:269). 
Figure 1 09 shows DN A59620- 1463 (SEQ ID NO:270). 

Figure 1 10 shows the native sequence PR0836 polypeptide UNQ545 (SEQ IDNO:271). 
Figure 1 1 1 shows DN A60627- 1 508 (SEO ID NO:272>. 

Figure 1 12 shows the native sequence PROI 159 polypeptide UNQ589 (SEQ ID NO:273). 
Figure 1 1 3 shows DN A64890- 1612 (SEQ ID NO:274). 

Figure 1 14 shows the native sequence PRO 1358 polypeptide UNQ707 (SEQ ID NO:275). 
Figure 1 15 shows DNA66659- 1 593 (SEQ ID NO:276). 

Figure 1 16 snows the native sequence PRO 1325 polypeptide UNQ685 (SEQ ID NO:277). 
Figure 1 1 7 shows DNA66667- 1 596 (SEQ ID NO:278). 

Figure 1 18 shows the native sequence PRO 1 338 polypeptide UNQ693 (SEQ ID NO:279). 
Figure 1 19 shows DNA68818-2536 (SEQ ID NO:280). 

Figure 120 shows the native sequence PROU34 polypeptide UNQ739 (SEQ ID NO:28 1). 
Figure 121 shows DNA842 10-2576 (SEQ ID NO:285). 

Figure 122 shows the native sequence PR04333 polypeptide UNQ1888 (SEQ ID NO:286). 
Figure 123 shows DNA9221S-2554 (SEQ ID NO:292). 

Figure 124 shows the native sequence PRO4302 polypeptide UNQ1866 (SEQ ID N0293). 
Figure 125 shows DNA96878-2626 (SEQ ID NO:294). 
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Figure 126 shows the native se qu en c e PRO4430 polypeptide UNQ! 947 fSEQ ID NO:295). 
Figure 127 shows DNA98853-I739 (SEQ ID NO:296). 

Figure 128 shows the native sequence PR05727 polypeptide UNQ2448 (SEQ ID NO:297). 
Detailed Deserimion of the Preferred Embodiments 

L Definitions 

The terms "PRO polypeptide^)" and "PRO" as used herein and when immediately followed by a 
numerical designation refer to various polypeptides, wherein the complete designation (U.. TRO/number" or 
more particularly. PRO200. PRO204. PR0212. PR0216. PR0226. PRO240. PR0235. PR0245. PR0172, 
PR0273. PR0272, PR0332. PR0526. PRO701, PR0361. PR0362. PR0363. PR0364. PR0356. PR0531. 
PR0533. PROI083. PROS65. PRO770. PR0769. PR0788. PROHI4. PRO1007. PROI184. PRO103I. 
PROI346. PROl 155. PROI250. PR013I2. PRO) 192. PRO 1 246. PR01283. PROI 195. PR01343. PROI4I8, 
PROI387. PRO14I0. PR019I7. PR01868. PRO205. PR021. PR0269. PR0344. PR0333. PR038I. PRO720, 
PR0866. PRO840. PR0982. PROS36. PROI 159. PR01358. PR01325. PR01338. PROI434, PR04333. 
PRO4302. PRO4430 or PR05727) refers to pamcular polypeptide sequences as described herein. The terms 
"PRO/number polypeptide" and "PRO/number" wheretn the term "number- is provided as an actual numerical 
designation (e.g.. as described above) as used herem encompass native sequence polypeptides and polypeptide 
variants (which are further defined herein). The PRO polypeptides described herein may be isolated from a 
variety of sources, such as from human ussue types or from another source, or prepared by recombinant or 
20 synthetic methods. 

A "native sequence PRO polypeptide^)" comprises a polypeptide having the same amino acid 
sequence as the corresponding PRO polypeptide derived from nature. Such native sequence PRO/number 
polypeptides can be isolated from nature or can be produced by recombinant or synthetic means. The term 
"native sequence PRO polypeptide/*)" specifically encompasses natumlly-occurring truncated or seemed 
forms of the spectf.c PRO/number polypeptide (e.g.. an extracellular domain sequence), naniraily-occurring 
variant forms (e.g.. alternatively spliced formsi and naturally-occurring allelic variants of the polypeptide. In 
various embodiments of the invenuon. the native sequence PRO polypept.de* disclosed herem are mature or 
fell-length native sequence polypeptides comprising the full-length amino acids sequences shown in the 
accompanying figures. Stan and stop codons ate shown in bold font and underlined in the figures. However, 
while the PRO/number polypeptides disclosed in the accompanying figures are shown to begin with 
methionine residues designated herein as amino acid position 1 in the figures, it is conceivable and possible 
that other methionine residues located either upstream or downstream from the amino acid position 1 in the 
figures may be employed as the starting amino acid residue for the PRO polypeptides. 

The "PRO polypeptide**) extracellular domain'- or "ECD~ refers to a form of the said polypeptide 
which is essentially free of the transmembrane and cytoplasmic domains. Ordinarily, a PRO polypeptide ECD 
will have less than 1% of such transmembrane and/or cytoplasmic domains and preferably, will have lea than 
0.5% of such domains. It will be understood that any transmembrane domains identified for the PRO 
polypeptides of the present invention are identified pursuant to criteria routinely employed in the art for 
identifying that rype of hydrophobic domain. The exact boundaries of a transmembrane domain may vary but 
most likely by no more than about 5 amino acids at either end of the domain as initially identified herein. 
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Optionally, therefore, an extracellular domain of a PRO polypeptide may contain from about 5 or fewer amino 
acids on either side of the transmembrane domain/ extracellular domain boundary as identified in the Examples 
or specification and such polypeptides, with or without the associated signal peptide, and nucleic acid encoding 
them, are contemplated by the present invention. 
5 The approximate location of the "signal peptides'* of the various PRO/number PRO polypeptides 

disclosed herein are shown in the present specification and/or the accompanying figures. It is noted, however, 
that the C-terminai boundary of a signal peptide may vary, but most likely by no more than about 5 amino 
acids on either side of the signal peptide C-terminal boundary as initially identified herein, wherein the C- 
tenninai boundary of the signal peptide may be identified pursuant to criteria routinely employed in the an for 
10 identifying that Type of amino acid sequence element (e.g., Nielsen et a/.. Prot. Eng. 10:1-6 (1997) and von 
Heinje ei aL NucL Acids. Res, ]4:4683-4690 (1986)). Moreover, it is also recognized that in some cases, 
cleavage of a signal sequence from a secreted polypeptide is not entirely uniform, resulting in more than one 
secreted species. These mature polypeptides, where the signal peptide is cleaved within no more than about 5 
amino acids on cither side of the C-terminal boundary of the signal peptide as identified herein, and the 
! 5 polynucleotides encoding them, arc contemplated by the present invention. 

A "PRO polypeptide variant", "PRO/number variant*' or "PRO variant" means an active PRO 
polypeptide as defined herein (e.g.. below) having at least about 80% amino acid sequence identity wuh a full- 
length native sequence PRO polypeptide sequence as disclosed herein, a PRO polypeptide sequence lacking ihe 
signal peptide as disclosed herein, an extracellular domain of a PRO polypeptide, with or without the signal 
20 peptide, as disclosed herein or any other fragment of a full-length PRO polypeptide sequence as disclosed 
herein. Such PRO polypeptide variants include, for instance, polypeptides wherein one or more amino acid 
residues are added, or deleted, at the N- or C-tenninus of the full-length native amino acid sequence. 
Oro^narily. a PRO polypeptide variant will have at least about 80% amino acid sequence identity, alternatively 
at least about 8 1 % amino acid sequence identity, alternatively at least about 82% amino acid sequence identity. 
25 alternatively at least about 83% ammo acid sequence identity, alternatively at least about 84% amino acid 
sequence identity, alternatively at least about K5% ammo acid sequence idenrity. alternatively at least about 
86% amino acid sequence identity, alternatively at least about 87% amino acid sequence identity, alternatively 
at least about 88% amino acid sequence identity, alternatively at least about 89% amino acid sequence identity, 
aJteniatively at least about 90% amino acid sequence identity, alternatively at least about 91% amino acid 
30 sequence identity, alternatively at least about 92% amino acid sequence identity, alternatively at least about 
93% amino acid sequence identity, alternatively at least about 94% amino acid sequence identity, alternatively 
at least about 95% amino acid sequence identity, alternatively at least about 96% amino acid sequence identity, 
alternatively at least about 97% amino acid sequence identity, alternatively at least about 98% amino acid 
sequence identity, alternatively at least about 99% amino acid sequence identity with a full-length native 
35 sequence PRO polypeptide sequence as disclosed herein, a PRO polypeptide sequence lacking the signal 
peptide as disclosed herein, an extracellular domain of a PRO polypeptide, with or without the signal peptide, 
as disclosed herein or any other specifically defined fragment of a mil-length PRO polypeptide sequence as 
disclosed herein. Ordinarily, PRO polypeptide variants are at least about 10 amino acids in length, 
alternatively at least about 20 amino acids in length, alternatively at least about 30 amino acids in length, 
40 alternatively at least about 40 amino acids in length, alterriatrvdy at least about 50 amino acids in length, 
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alternatively at least about 60 amino acids in length, alternatively at least about 70 amino acids in length, 
alternatively at least about 80 amino acids in length, alternatively at least about 90 amino acids in length, 
alternatively at least about 100 amino acids in length, alternatively at least about 150 ammo acids in length, 
alternatively at least about 200 amino acids in length, alternatively at least about 300 amino acids in length, 
5 alternatively at least about 400 amino acids on length, alternatively at least about 500 amino acids in length, 
alternatively at least about 600 amino acids in length, alternatively at least about 700 amino acids in length, 
alternatively at least about 800 amino acids in length, alternatively at least about 900 amino acids in length, 
alternatively at least about 1000 amino acids in length, alternatively at least bout 1200 amino acids in length, 
alternatively at least about 1400 amino acids in length, alternatively at least about 1500 amino acids in length 
10 or more. 

Percent (%) amino acid sequence identity" with respect to the PRO polypeptide sequences identified 
herein is defined as the percentage of amino acid residues in a candidate sequence that are identical with the 
arnino acid residues in the specific PRO/number polypeptide sequence, after aligning the sequences and 
introducing gaps, if necessary, to achieve the maximum percent sequence identity, and not considering any 
15 conservanve substitutions as pan of the .sequence identity. Alignment tor purposes of determining percent 
amino acid sequence identity can be achieved in various ways that are within the skill in the an. for instance, 
using publicly available computer software such as BLAST, BLAST-2. ALIGN or Megaiign (DNASTAR) 
software. Tliose skilled in the an can determine appropriate parameters for measuring alignment, including 
any algorithms needed to achieve maximal alignment over the full length of the sequences being compared. 
For purposes herein, however. % amino acid sequence identity values are generated using the sequence 
comparison computer program ALIGN-2. wherein the complete source code for the AUGN-2 program is 
provided in Table I below. The AUGN-2 sequence comparison computer program was authored by 
Genentech. Inc. and the source code shown in Tables I below has been filed with user documentation in the 
U.S. Copyright Office, Washington D.C.. 20559. where it is registered under U.S. Copyright Registration No. 
25 TOU510087. The AUGN-2 program is publicly available through Genentech. Inc.. South San Francisco. 
California or may be compiled from the source code provided in Table 1 below. The AUGN-2 program 
should be compiled for use on a UNIX operating system, preferably digital UNIX V4.0D. All sequence 
companion parameters are set by the AUGN-2 program and do not vary. 

In situauons where AUGN-2 is employed for amino acid sequence comparisons, the % amino acid 
30 sequence identity of a given amino acid sequence A to, with, or against a given amino acid sequence B (which 
can alternatively be phrased as a given amino acid sequence A that has or comprises a certain % amino acid 
sequence identity to, with, or against a given amino acid sequence B) is calculated as follows: 



20 



35 



100 times the fraction X/Y 



where X is me number of amino acid residues scored as identical matches by the sequence alignment program 
AUGN-2 in that program's alignment of A and B, and where Y is the total number of arnino acid residues in 
B. It will be appreciated that where the length of amino acid sequence A is not equal to the length of amino 
acid sequence B, the % amino acid sequence identity of A to B will not equal the % amino acid sequence 
40 identity of B to A. As examples of % amino acid sequence identity calculations using this method. Tables 2 
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and 3 demonstrate how 10 calculate the % amino acid sequence identity of the amino acid sequence designated 
"Comparison Protein" to the amino acid sequence designated "PRO", wherein "PRO" represents the ammo 
acid sequence of a hypothetical PRO/number polypeptide of interest, "Comparison Protein" represents the 
amino acid sequence of a polypeptide against which the "PRO" polypeptide of interest is being compared, and 
5 M X T" and "Z" each represent different hypothetical amino acid residues. 

Unless specifically stated otherwise, all % amino acid sequence identity values used herein are 
obtained as described in the immediately preceding paragraph using the AL1GN-2 computer program. 
However, % amino acid sequence identity values may also be obtained as described below by using the WU- 
BLAST-2 computer program (Altschul et aL Methods in Enzymoiogy 266:460-480 (1996)). Most of the WU- 
10 BLAST- 2 search parameters are set to the default values. Those not set to default values. the adjustable 
parameters, are set with the following values: overlap span * I . overlap fraction ** 0.125, word threshold (T) - 
1 1, and scoring matrix = BLOSUM62. When WU-BLAST-2 is employed; a % amino acid sequence identity 
value is determined by dividing (a) the number of matching identical amino acid residues between the amino ( 
acid sequence of the PRO polypeptide of interest having a sequence derived from the native sequence PRO 
15 polypeptide and the comparison amino acid sequence of imerest (i.e.. the sequence against which the PRO 
polypeptide is being compared - which may be a PRO polypeptide variant* as determined by WU-BLAST-2 by 
{b) the total number of amino acid residues of the PRO polypeptide of interest. For example, in the statement 
"a polypeptide comprising an amino acid sequence A which has or having at least 80% amino acid sequence 
identity to the amino acid sequence B'\ the amino acid sequence A is the comparison amino acid sequence of 
20 interest and the amino acid sequence B is the amino acid sequence of the PRO polypeptide of interest 

Percent amino acid sequence identity may also be determined using the sequence comparison program 
NCBI-BLAST2 (Altschul et aL Nucleic Acids Res. 25:3389-3402 (1997)). The NCBI-BLAST2 sequence 
comparison program may be downloaded from "htm://www.ncbi.nlm.gov" or otherwise obtained from the 
National Institute of Health. Bcthesda, MD. NCBI-BLAST2 uses several search parameters, wherein all of 
25 those search parameters are set to default values including, for example, unmask « yes. strand = all. expected 
occurrences - 10. minimum low complexity length = 15/5, multi-pass e-value * 0.01. constant for multi-pass * 
25. dropotT for final gapped alignment * 25 and scoring matrix * BLOSUM62. 

In situations where NCBI-BLAST2 is employed for amino acid sequence comparisons, the % amino 
acid sequence identity of a given amino acid sequence A to. with, or against a given amino acid sequence B 
30 (which can alternatively be phrased as a given amino acid sequence A that has or comprises a certain % amino 
acid sequence identity to, with, or against a given amino acid seq uen ce B) is calculated as follows: 

100 times the fraction X/Y 

35 where X is the number of amino acid residues scored as identical matches by the sequence alignment program 
NCBI-BLAST2 in that program's alignment of A and B, and where Y is the total number of amino acid 
residues in B. It win be appreciated mat where the length of ammo acid sequence A is not equal to the length 
of amino acid sequence B, the % amino acid sequence identity of A to B will not equal the % amino acid 
sequence identity of B to A. 
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"PRO variant polynucleotide" or "PRO variant nucleic acid sequence" means a nucleic acid molecule 
which encodes an active PRO polypeptide as defined below and which has at least about 80% nucleic acid 
sequence identity with a nucleotide sequence encoding: (I) a full-length native sequence PRO polypeptide as 
disclosed herein: (2) a full-length native sequence PRO polypeptide lacking the signal peptide as disclosed 
herein: (3) an extracellular domain of a PRO polypeptide, with or without the signal peptide, as disclosed 
herein or (4) any other fragment of a full-length PRO polypeptide sequence as disclosed herein. Ordinarily, a 
PRO polypeptide variant polynucleotide will have at least about 80% nucleic acid sequence identity, 
alternatively at least about 81% nucleic acid sequence identity, alternatively ar least about 82% nucleic acid 
sequence identity, alternatively at least about 83% nucleic acid sequence identity, alternatively at least about 
84% nucleic acid sequence identity, alternatively at least about 85% nucleic acid sequence identity, 
alternatively at least about 86% nucleic acid sequence identity, alternatively at least about 87% nucleic acid 
sequence identity, alternatively at least about 88% nucleic acid sequence identity, alternatively at least about 
89% nucleic acid sequence identity, alternatively at least about 90% nucleic acid sequence identity, 
alternatively a, leas, about 91% nucleic acid sequence identity, alternatively a, leas, about 92% nucleic acid 
sequence identity, ahcmaux-ely at least about 93% nucleic ac.d sequence identity, alternatively at least about 
94% nucleic ac,d sequence identity, alternatively a. least about 95% nucleic acid sequence identity, 
alternatively a, leas, about 96% nucleic acid sequence .denary, alternately a. leas, abou, 97% nucleic a«d 
sequence identity, alternatively at least about 98% nucleic ac.d sequence identity, alternatively at leas, about 
99% nucleic acid sequence identity with (1) a nucleic acid sequence encoding a full-length native sequence 
PRO polypeptide sequence as disclosed herein. (2) a full-length native sequence PRO polypeptide sequence 
lacking the signal peptide as disclosed herein, (3> an extracellular domain of a PRO polypeptide sequence, with 
or without the signal sequence, as disclosed herein or (4) any other fragment of a full-length PRO polypeptide 
on sequence as disclosed herein. Variants do not encompass the native nucleotide sequence. 

Ordinarily. PRO polypeptide variant polynucleotides are at least about 30 nucleotides in length, 
alternately a, least about 60 nucleotides in length, altemauvely at least about 90 nucleotides in length, 
alternatively a, leas, about 120 nucleotides .„ length, altemauvely at leas, about .50 nucleotides in length, 
altemauvely at leas, about 180 nucleot.des in length, alternatively a, least about 210 nucleotides in leneth. 
alternatively « leas, about 240 nucleotides in length, actively a. leas, about 270 nucleotides in length, 
alternatively at least about 300 nucleotides in length, alternatively at leas, about 450 nucleotides in length, 
alternatively a, least about 500 nucleotides in length, altemanvely a, leas, about 600 nucleotides in length, 
alternatively at least about 700 nucleotide* in length, alternatively a, least about 800 nucleotides in length, 
altemauvely at least about 900 nucleotides in length, alternatively ., least about 1000 nucleotides in length, 
alternatively a, least about 1200 nucleotides in length, alternatively at least about 1400 nucleotides in length, 
alternatively a, least about 1600 nucleotides in length, alternatively at least about 1800 nucleotides in length, 
alternatively at least about 2000 nucleotides in length, alternatively at leas, about 2500 nucleotides in leng*, 
alternatively at las, about 3000 nucleotides m length, alternatively at leas, about 3500 nucleotides in length, 
alternatively at least .bow 4000 nucleotides, alternatively at least about 5000 nucleotides or more. 

•Percent (%) nucleic acid sequence identity- with respect to PROencoding nucleic acid sequences 
tdennfied herein is defined as the percentage of mutoo^ * % c.*^ a* Umdetl ito ** 

nucleotides in the PRO nucleic acid sequence of interest, after aligning the sequences and introducing gaps, if 
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necessary, to achieve the maximum percent sequence identity. Alignment for purposes of determining percent 
nucleic acid sequence identity can be achieved in various ways chat are within the skill in the art, for instance, 
using publicly available computer software such as BLAST. BLAST-2. ALIGN or Megaiign (DNASTAR) 
software. For purposes herein, however, % nucleic acid sequence identity values are generated using the 
5 sequence comparison computer program ALIGN-2. wherein the complete source code for me AL1GN-2 
program is provided in Table 1 below. The ALIGN- 2 sequence comparison computer program was authored 
by Genentech. Inc. and the source code shown in Table I below has been filed with user documentation in the 
VS. Copyright Office. Washington D.C.. 20559. where it is registered under U.S. Copyright Registration No. 
TXU5I0087. The ALIGN-2 program is publicly available through Genentech. Inc.. South San Francisco, 

10 California or may be compiled from the source code provided in Table 1 below. The ALIGN-2 program 
should be compiled for use on a UNTX operating system, preferably digital UNIX V4.0D. All sequence 
comparison parameters are set by the ALIGN-2 program and do not vary. 

In situations where ALIGN-2 is employed for nucleic acid sequence comparisons, the % nucleic acid 
sequence identity of a given nucleic acid sequence C to. with, or against a given nucleic acid sequence D 

15 i which can alternatively be phrased as a given nucleic acjd sequence C that has or comprises a certain % 
nucleic actd sequence identity to. with, or against a given nucleic acid sequence D) is calculated as follows: 

100 times the fraction W/Z 

20 where W is the number of nucleotides scored as identical matches by the sequence alignment program ALIGN- 
2 in that program's alignment of C and D. and where Z is the total number of nucleotides in D. It will be 
appreciated that where the length of nucleic acid sequence C is not equal to the length of nucleic acid sequence 
D, the % nucleic acid sequence identity of C to D will not equal the % nucleic acid sequence identity of D to C. 
As examples of % nucleic acid sequence identity calculations. Tables 4 and 5. demonstrate how to calculate 

25 the % nucleic acid sequence identity of the nucleic acid sequence designated "Comparison DNA" to the 
nucleic acid sequence designated "PRO-DNA". wherein TRO-DNA w represents a hypothetical PRO 
polypeptide - encoding nucleic acid sequence of interest. "Comparison DNA" represents the nucleotide 
sequence of a nucleic acid molecule against which the "PRO-DNA" nucleic acid molecule of interest is being 
compared, and H N\ "L w and M V W each represent different hypothetical nucleotides. 

30 Unless specifically stated otherwise, all % nucleic acid sequence identity values used herein are 

obtained as described in the immediately preceding paragraph using the ALIGN-2 computer program. 
However. % nucleic acid sequence identity values may also be obtained as described below by using the WU- 
BLAST-2 computer program ( Altschul et ai. Methods in Enzymoio^ 266:460-480 (1996)). Most of the WU- 
BLAST-2 search parameters are set to the default values. Those not set to default values, i.e.. the adjustable 

35 parameters, are set with the following values: overlap span - I, overlap fraction - 0.125. word threshold (T) - 
1 1. and scoring matrix - BLOSUM62. When WU-BLAST-2 is employed, a % nucleic acid sequence identity 
value is determined by dividing (a) the number of matching identical nucleotides between the nucleic acid 
sequence of me PRO polypeptide - encoding nucleic acid molecule of interest having a sequence derived from 
the native sequence PRO polypeptide - encoding nucleic acid (*.*.. the reference sequence) and the comparison 
40 nucleic acid molecule of interest {i.e., the sequence against which the PRO polypeptide - encoding nucleic acid 
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molecule of interest is being compared - which may be a PRO variant polynucleotide) as determined by WU- 
BLAST-2 by (b) the total number of nucleotides of the PRO reference sequence. For example, in the statement 
"an isolated nucleic acid molecule comprising a nucleic acid sequence A which has or having at least 80% 
nucleic acid sequence identity to the nucleic acid sequence B". the nucleic acid sequence A is the comparison 
nucleic acid molecule of interest and the nucleic acid sequence B is the nucleic acid sequence of the PRO 
polypeptide - encoding nucleic acid molecule of interest 

Percent nucleic acid sequence identity may also be determined using the sequence comparison 
program NCBI-BLAST2 (Altschul a ul.. Nucleic Acid, Res. 25:3389-3402 (1997)). The NCBI-BLAST2 
sequence comparison program may be downloaded from -h«p^www.„cb,.nlm.nih.goV or otherwise obtained 
from the National Institute of Heath. Bethesda. MD. NCBI-BLAST2 uses several search parameters, wherein 
all of those search parameters are set to default values including, for example, unmask = yes. strand = all. 
expected occurrences - 10. minimum low complexity length = 15/5. multi-pass e-value - 0.01. constant for 
multi-pass » 25. dropoff for final gapped alignment - 25 and scoring matrix = BLOSUM62. 

In situations where NCBI-BLAST2 is employed for sequence comparisons, the % nucleic acid 
sequence identity of a given nucleic acid sequence C to. with, or agamst a eiven nucleic acid sequence D 
(which can alternately be phrased as a given nucleic acid sequence C that has or comprises a certain % 
nucleic acid sequence identity to. with, or against a gh« nucleic acid sequence D) is calculated as follows: 

1 00 times the fraction VWZ 

where W is the number of nucleotides scored as identical matches by the sequence alignment program NCBI- 
BLAST2 in that program s alignment of C and D. and where Z is the total number of nucleotides in D. It will 
be appreciated that where the length of nucleic acid sequence C is no, equal to the length of nucleic acid 
sequence D. the % nucleic acid sequence identity of C to D will not equal the % nucleic acid sequence identity 
ofDtoC. 

In outer embodiments. PRO vanant polynucleotides are nucleic ac.d molecules that encode an active 
PRO polypep„de and which arc capable of hybridiztng, preferably under sningen. hvbridization and wash 
commons, to nucleotide sequences encoding a full-length PRO polypeptides as disclosed herein. PRO variant 
polypeptides may be those that are encoded by a PRO variant polynucleotide. 

The term "positives', in the context of sequence comparison performed as described above, includes 
residues in the sequences compared that are no, identical but have sunilar properties (e.g.. as a result of 
conservative substitutions, see Table 6 below). For purposes herein, the % value of positives is determined by 
mviding (a, the number of ammo add residues scoring a positive value between the PRO polypeptide sequence 
of tmerest havmg a sequence derived from a native sequence PRO polypeptide and the comparison amino acid 
sequence of mterest (Le.. the amino acid sequence against which the PRO polypeptide sequence is being 
compared) as determined in the BLOSUM62 matrix of WU-BLAST-2 by (b) toe total number of amino acid 
residues of the PRO polypeptide of interest 

Unless specifically stated otherwise, the % value of positives is calculated as described in the 
immediately preceding paragraph. However. m the context of the amino acid sequence idenmy comparisons 
performed as described for AUGN-2 and NCBI-BLAST-2 above, mcludes amino acid residues in me 
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sequences compared that are not only identical, but also those that have similar properties. Amino acid 
residues mat score a positive value to an amino acid residue of interest are those that are either identical to the 
amino acid residue of interest or are a preferred substitution (as defined in Table 1 below) of the ammo acid 
residue of interest 

5 For amino acid sequence comparisons using ALIGN-2 or NCBI-BLAST2. the % value of positives of 

a given amino acid sequence A to. with, or against a given amino acid sequence B (which can alternatively be 
phrased as a given amino acid sequence A that has or comprises a certain % positives to. with, or against a 
given amino acid sequence B) is calculated as follows: 

10 100 times the fraction X/Y 

where X is the number of amino acid residues scoring a positive value as defined above by the sequence 
alignment program ALIGN-2 or NCBI-BLAST2 in that program's alignment of A and B. and where Y is the 
total number of amino acid residues in B. It will be appreciated that where the length of ammo acid sequence 
15 A is not equal to the length of amino acid sequence B. the positives of A to B will not equal the % positives 
of B to A. 

"Isolated." when used to describe the various polypeptides disclosed herein, means polypeptide that 
has been identified and separated and/or recovered from a component of its natural environment. Contaminant 
components of its natural environment are materials that would typically interfere with diagnostic or 

20 therapeutic uses for the polypeptide, and may include enzymes, hormones, and other protemaceous or non- 
proteinaceous solutes. In preferred embodiments, the polypeptide will be purified (1) to a degree sufficient to 
obtain at least 15 residues of N-termina) or internal amino acid sequence by use of a spinning cup sequenator. 
or (2) to homogeneity by SDS-PAGE under non-reducing or reducing conditions using Coomassie blue or. 
preferably, silver stain. Isolated polypeptide includes polypeptide in sim within recombinant cells, since at 

25 least one component of the PRO polypeptide in its natural environment will not be present. Ordinarily, 
however, isolated polypeptide will be prepared by at least one purification step. 

An "isolated" PRO polypeptide - encoding nucleic acid or other polypeptide-encoding nucleic acid is 
a nucleic acid molecule that is identified and separated from at least one contaminant nucleic acid molecule 
with which it is ordinarily associated in the natural source of the polypeptide-encoding nucleic acid. An 

30 isolated polypeptide-encoding nucleic acid molecule is other than in the context or setting in which it is found 
in nature. Isolated polypeptide - encoding nucleic acids therefore are distinguished from the polypeptide * 
encoding nucleic acid molecule existing in natural celh. However, an isolated PRO polypeptide - encoding 
nucleic acid molecule includes the same contained in cells that ordinarily express the specific polypeptide 
where, for example, the nucleic acid molecule is in a chromosomal location different from mat of natural cells. 

35 The term "control sequences" refers to DNA sequences necessary for the expression of an operably 

linked coding sequence in a particular host organism. The control sequences that are suitable for prokaryotes, 
for example, in c l u d e a promoter, optionally an operator sequence, and a ribosome binding site. Eukaryotic 
cells are known to utilize promoters, polyadenylation signals, and enhancers. 

Nucleic acid is "operably linked" when it is placed into a functional relationship with another nucleic 

40 acid sequence. For example, DNA for a presequence or secretory leader is operably linked to DNA for a 
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polypepode ,f « is expressed as a preprotein that panic.pa.es in the secretion of the polypeptide: * promoter or 
enhancer ,s operably Unked to a coding sequence if i, affects the tnmscription of the sequence: or a ribosome 
bmdmg site is operably linked to a coding sequence if it is positioned so as to facilitate transition. Generally 
"operably linked' means that the DNA sequences being linked are contiguous, and. in the case of a secretory' 
leader, contiguous and in reading phase. However, enhancers do not have to be contiguous. Lu*^ is 
accomplished by ligation a, convenient restriction sites. If sueh sites do not exist the synthetic oligonucleotide 
adaptors or linkers are used in accordance with conventional practice. 

•Stringency- of hybridization reactions is readily deierminable by one of ordinary skill in the an. and 
generally is an empirical calculation dependent upon probe length, washing temperature, and salt 
concentrat.cn. In genera., longer probes require higher temperarures for proper ammaling, wh,le shorter probes 
need lower temperatures. Hybridization generally depends on the ability of denatured DNA to reatmeal when 
complementary strands are present in an envimnmem below their melting temperature. The higher the degree 
of des.red homology between the probe and bybridizable sequence, me higher the relative temperature which 
can be used. As a result i, follows mat higher relative temperatures would tend to make .he reaction 
15 condmons more stringent wh.le lower temperarures less so. For additional details and explanation of 
stnngency of hybridization reactions, see Ausube. e, „/.. Current Protocol, in Molecular Biology. Wiley 
Intersccnce Publishers. ( 1 995). 

"Stringem conditions" or "high stnngency conditions." as defined herem. mav be identified by those 
that (I, emptey tew ionic suength and high temperature for washing, for example 0.015 M sodium 
20 ch.oride0.0015 M sodium citram/0.1% sodium dodecy. su.fate at 50-C: (2) employ during hybridization a 
denamnng agent such as fotmamide. for examp.e. 50-/. ,v/v, .brmamide with 0.1% bovme serum 
a,bumin,0..% FicoU/O..-/. po.yviny.pyrro.idone/50 mM sodmm phospha.e buffer a, pH 6.5 with 750 mM 
sodium ch.oride. 75 mM sodium crate a, 42"C; or (3) employ 50-/. fonaamide. 5 x SSC (0.75 M NaCI. 0 075 
M sodium ci.au,. 50 mM sodium phosphate (p H 6.8). 0..% sodium pyrophosphate. 5 x Denhardfs so.ut.on. 
25 somcamdsalmonspennDNA.SO W m... 0..% SDS. and .0% dextran su.fate at42"C. with washesat^C in 
0.2 x SSC (sodium chloridcsodium crate, and 50% formam.de a, 55'C. fo.lowed by a h.gh-stringencv wash 
consisting of 0.1 x SSC coniaimnc £DTA at 55°C 

"Modenue.y sumgen. conditions" may be identified as described by Sambrook « ai. Molecular 
a^Al^toryManunl^^CM^^^ 1 989. and inch^ me use of washing 
30 »,utio„ and hybndization conditions { , g .. temperamre. ionic strength ami %SDS) .ess stringem that those 
desenbedabove. An example of moderately sumgem conmnorts is overnight incubation at 37TT in a solution 
compnsmg: 20% fotnumude. 5 x SSC (.50 «M NaCL 15 mM trisodium curate). 50 mM sodium phosphate 
0.H 7.6). 5 x Denhardfs solution. .0% dextran su.fate. and 20 M g/mL denamnm sheared salmon sperm DNA. 
followed by washing the film* « . x SSC a, about 37-50-C. The akiUeo anisan wfl. recogmze how to adjust 
the temperature, tonie sntngth, etc.. as necessary to accommodate facers such a, probe length and the like. 

Annbodms- (Abs) and -immunoglobulins" (Igs) are glycoproteins having the same general structural 
chamcKnsucs. While antibod.es exhibit binding specificity to a specific antigen, immunoglobulins include 
bo* antibodies and other antfcody-.ike molecules which lack antigen specificity. Polypeptides of the latter 
tend are. for example, produced at low ,eve b by .he .ymph system and a. mcreased levels by myelomas. The 
term "antibody" is used in the broadest sense and specifically covers, without limhation, intact monoclonal 



21 



SUBSTITUTE SHEET (RULE 26) 



(C) 2003 Copyright Derwenl Information Ltd. 



WO 00/53758 



PCT/US00/0584I 



antibodies (including agonist, antagonist and neutralizing antibodies), polyclonal antibodies, multispecific 
antibodies (e.g.. bispecific antibodies) formed from at least two intact antibodies, single chain antibodies 
binding the epitopes specific to the PRO polypeptide and antibody fragments so long as they exhibit the desired 
biological activity. An anti-PRO200. anti-PRO204. anti-PR0212. anri-PR0216. anu-PR0226, am>PRO240, 
ami-PR0235, anu-PR0245, anti-PROl72. anti-PR0273, anti-PR0272. ami-PR0332. and-PR0526, atm- 
PRO701. anu-PR0361, anti-PR0362. anti-PR0363. anti-PR0364. anri-PR0356. anti-PR053I. ann-PR0533, 
ami-PRO1083. ami-PR0865. anti-PRO770. ami-PR0769. anti-PR0788. ami-PROllU. anti-PRO1007. anti- 
PROl 184. anti-PRO103 1, anti-PROl346. anti-PROI 155. anri-PRO1250. anti-PROI312, anti-PROl 192. anu- 
PR01246, anti-PR01283. anti-PROl 195. anti-PROl343, anti-PR014l8. ami-PR01387. anti-PROUlO. anti- 
PR0I917. ami*PR01868. anti-PRO205. anu-PR02L anti-PR0269. anti.PR0344. anti-PR0333. anti-PR0381, 
anti-PRO720, anti-PR0866. anti-PRO840. anti.PR0982. anti-PR0836. anti-PROl 159. anu-PR01358. ami- 
PR01325. anti-PRO!338, anti-PROI434. anti-PR04333. ami-PRO4302, anti-PRO4430 or anti-PR05727 
antibody is an antibody which immunologically binds to a PRO200. PRO204. PR0212, PR0216. PR0226. 
PRO240. PR0235. PR0245. PR0172, PR0273. PR0272. PR0332, PR0526. PRO70I, PR0361. PR0362. 
PR0363. PR0364. PR0356. PR053I. PR0533. PRO 1083. PR0865. PRO770. PR0769. PR0788. PROI 1 14. 
PRO1007. PR01184. PROIOJI, PR01346. PROJ155. PROI250. PR01312. PROI 192. PR01246. PROI283. 
PROI 195. PROI343. PR0141S. PR01387. PR0141Q. PROI9I7. PR01868. PRO205. PR021. PR0269. 
PR0344. PR0333. PR038 1 . PRO720. PROS66. PR0840. PR0982. PR0836. PROI 159. PR01358. PR01325. 
PR01338. PROI434. PR04333. PRO4302. PRO4430 or PR05727, respectively, polypeptide. The antibody 
may bind to any domain of the PRO polypeptide which may be contacted by the antibody. For example, the 
antibody may bind to any extracellular domain of the polypeptide and when the entire polypeptide is secreted, 
to any domain on the polypeptide which is available to the antibody for binding. 

"Native antibodies" and "native irnmunoglobulins" arc usually heterotetrameric glycoproteins of about 
150.000 dalions. composed of two identical light (L) chains and two identical heavy (H) chains. Each light 
chain is linked to a heavy chain by one covalent disulfide bond, while the number of disulfide linkages varies 
among the heavy chains of different immunoglobulin isorypes. Each heavy and light chain also has regularly 
spaced intrachain disulfide bridges. Each heavy chain has at one end a variable domain ( V„) followed by a 
number of constant domains. Each light chain has a variable domain at one end (VJ and a constant domain at 
its other end: the constant domain of the light chain is aligned with the first constant domain of the heavy 
chain, and the light-chain variable domain is aligned with the variable domain of the heavy chain. Particular 
amino acid residues are believed to form an interface between the light- and heavy-chain variable domains. 

The term "variable" refers to the fact that certain portions of the variable domains differ extensively in 
sequence among antibodies and are used in the binding and specificity of each particular antibody for its 
particular antigen. However, the variability is not evenly distributed throughout the variable domains of 
antibodies. It is concentrated in three or four segments called "complementarity-determining regions" fCDRs) 
or "hypervariable regions" in both in the light-chain and the heavy-chain variable domains. The more highly 
conserved portions of variable domains are called the framework (FR). The variable domains of native heavy 
and tight chains each comprise four or five FR regions, largely adopting a p-sheet configuration, connected by 
the CDRs, which form loops connecting, and in some cases forming part o£ the 0-sheet structure. The CDRs 
in each chain are held together in close proximity by the FR regions and. with the CDRs from the other chain. 
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comributc to die formation of the antigen-binding site of antibodies (see Rabat et aL NLH Publ. No.91-3242, 
Vol 1, pages 647-669 (1991)). There are at least rwo techniques for determining the extent of the CDRs: (1) 
An approach based on the extent of cross-species sequence variability U. c. Kabat et a!.. Sequences of Proteins 
of Immunological Interest (National Institute of Health, Bcthcsda. MD); and (2) an approach based on 
crystaJiographic studies of antigen-antibody complexes (Chothia, C et a/., (1989), Nature 342: 877). 
Moreover. CDR's can also be defined using a hybrid approach incorporating the residues identified by both of 
the previous techniques. The constant domains are not involved directly in binding an antibody to an antigen, 
but exhibit various effector functions, such as pagination of the antibody in antux^-dependent cellular 
toxicity. 

•Antibody fragments" comprise a portion of an intact antibody, preferably die antigen binding or 
variable region of the intact antibody. Examples of ant, body fragments include Fab. Fab'. F(ab'h, and Fv 
fragments: diabodies: linear antibodies (Zapata a a i. Protein En S . 8 (10): 1057- 1 062 [\99S]); single^ham 
antibody molecules: and multispecific antibodies formed from antibody fragments. 

Papain digestion of antibodies produces two identical antigen- binding fragments, called "Fab- 
fragments, each with a single antigen-binding site, and a resulual "Fc" fragment, whose name reflects its ability 
to crystallize readily. Pepsin treatment yields an Nab'), fragment that has rwo antigen-cotnbming sites and is 
still capable of cross-linking antigen. 

"FV is the minimum antibody fragment which contains a complete antigen-recognition and binding 
site. This region consists of a dimer of one heavy- and one light-cha.n variable domain in tight non-covalent 
association. It is in this configuration that the three CDRs of each variable domain interact to define an 
antigen-binding s.te on the surface of the V H -V L dimer. Collectively, the six CDRs confer antigen-binding 
specificity to the antibody. However, ex-en a single variable domain (or half of an Fv comprising only three 
CDRs specific for an antigen) has the ability to recognize and bind antigen, although at a lower affinity than 
the entire binding site. 

The Fab fragment also contains the constant domain of the light chain and the first constant domain 
(CHI) of the heavy chain. Fab' fragments differ from Fab fragments by .he addition of a few residues at the 
camoxy terminus of the heavy chain CHI domam including one or mom cysteines from the antibody hinge 
region. Fab'-SH is the designation herein for Fab' in which the cysteine residues) of the constant domains bear 
a free thiol group. Ffab'^ antibody fragments originally were produced as pairs of Fab' fragments which have 
hinge cysteines between them. Other chemical couplings of antibody fragments are also known. 

The -light chains" of antibodies (immunoglobulins) from any vertebrate species can be assigned to 
one of two clearly distinct types, called kappa «1 and lambda (X), based on the amino acid sequences of their 
constant domains. 

Depending on the amino acid sequence of the constant domain of their heavy chains, 
immunoglobulins can be assigned to different classes. There me five major classes of immunoglobulins: IgA, 
IgD. IgE, IgG. and IgM. and several of these may be further divided into subclasses (isotypes), e.g.. IgGl. 
IgG2, IgG3, IgG4. IgA. and IgAi The heavy-chain constant domains that correspond to die different classes 
of immunoglobulins are called a, 6, e. y. and u, respectively. The subunit structures and three-dimensional 
configurations of different classes of immunoglobulins are well known. 
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The term "monoclonal antibody" as used herein refers to an antibody obtained from a population of 
substantially homogeneous antibodies, i.e., the individual antibodies comprising (he population are i dentica l 
except for possible naturally occurring mutations that may be present in minor amounts. Monoclonal 
antibodies are highly specific, being directed against a single antigenic site. Furthermore, in contrast to 
5 conventional (polyclonal) antibody preparations which typically include different antibodies directed against 
different determinants (epitopes), each monoclonal antibody is directed against a single determinant on the 
antigen. In addition to their specificity, the monoclonal antibodies are advantageous in that they are 
synthesized by the hybridoma culture, uncontaminated by other immunoglobulins. The modifier "monoclonal" 
indicates the character of the antibody as being obtained from a substantially homogeneous population of 
1 0 antibodies, and is not to be construed as requiring production of the antibody by any particular method. For 
example, the monoclonal antibodies to be used in accordance with the present invention may be made by the 
hybridoma method first described by Jtohler et aL Nature. 256: 495 [1975]. or may be made by recombinant 
DNA methods (see, e.g., U.S. Patent No. 4,816.567). The "monoclonal antibodies' 1 may also be isolated from 
phage antibody libraries using the techniques described in Clackson a aL Nature. 352:624-628 [1991] and 
15 Marks et aL J. MoL BioL 222:581-597 (1991 ). lor example. See also U.S Patent Nos. 5.750,373. 5.571.698. 
5.403.484 and 5.223.409 which describe the preparation of antibodies using phagemid and phage vectors. 

The monoclonal antibodies herein specifically include "chimeric" antibodies (immunoglobulins) in 
which a ponton of the heavy and/or light chain is identical with or homologous to corresponding sequences in 
antibodies derived from a particular species or belonging to a particular antibody class or subclass, while the 
20 remainder of the chain(s) is identical with or homologous to corresponding sequences in antibodies derived 
from another species or belonging to another antibody class or subclass, as well as fragments of such 
antibodies, so long as they exhibit the desired biological activity (U.S. Patent No. 4,816.567: Morrison et aL 
Proc. Natl. Acad. ScL USA. 81:6851-6855 [1984]). 

"Humanized" forms of non-human {e.g., murine) antibodies are chimeric immunoglobulins. 
25 immunoglobulin chains or fragments thereof (such as Fv, Fab. Fab*. F(ab') 2 or other antigen-binding 
subsequences of antibodies) which contain minimal sequence derived from non-human immunoglobulin. For 
the most pan. humanized antibodies are human immunoglobulins (recipient antibody) in which residues from a 
complememarity-detennining region (CDR) of the recipient are replaced by residues fiom a CDR of a non- 
human species (donor antibody) such as mouse, rat or rabbit having the desired specificity, affinity, and 
30 capacity. In some instances. Fv framework region (FR) residues of the human immunoglobulin are replaced by 
corresponding non-human residues. Furthermore, humanized antibodies may comprise residues which are 
found neither in the recipient antibody nor in the imponed CDR or framework sequences. These modifications 
are made to further refine and maximize antibody performance. In general, the humanized antibody will 
comprise substantially alt of at least one. and typically two, variable domains, in which all or substantially all 
35 of the CDR regions correspond to those of a non-human immunoglobulin and all or substantially all of the FR 
regions are those of a human immunoglobulin sequence. The humanized antibody optimally also will 
comprise at least a portion of an immunoglobulin constant region (Fc), typically that of a human 
immunoglobulin. For further details, sec Jones et aL Nature, 321 :522-525 (1986); Reichmann et aL Nature, 
332:323-329 [1988]; and Presta, Curr. Op. Struct BioL 2:593-596 (1992). The humanized antibody includes a 
40 ■priraatizc4 ,, antibody where the antigen-binding region of the antibody is derived from an antibody produced 
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by immunizing macaque monkeys with the antigen of interest. Antibodies containing residues from Old World 
monkeys are also possible within the invention. See, for example. U.S. Patent Nos. 5,658,570; 5.693,780; 
5,681.722; 5,750,105; and 5.756.096. 

Antibodies and fragments thereof in this invention also include 'affinity matured" antibodies in which 
an antibody is altered to change the amino acid sequence of one or more of the CDR regions and/or the 
framework regions to alter the affinity of the antibody or fragment thereof for the anngen to which it binds. 
Affinity maturation may result in an increase or in a decrease in the affinity of the matured antibody for the 
anngen relative to the starting antibody. Typically, the starting antibody will be a humanized human, chimeric 
or murine anubody and the affinity matured antibody will have a higher affinity than the starring antibody. 
During the maniration process, one or more of the amino acid residues in the CDRs or in the framework 
regions are changed to a different residue using any standard method. Suitable methods include point 
mutations using well known cassette mutagenesis methods (Wells et aL 1 985. Gene 34:3 15) or oligonucleotide 
mediated mutagenesis methods (Zoller et aL 1987, Nucleic Acids Res. 10:6487-6504). Affinity mamration 
may also be performed using known selection methods in which many mutations are produced and mutants 
15 having ihc desired affinity are selected from a pool or library of mutants based on improved affinity for the 
anugen or ligand. Known phage display techniques can be conveniently used in this approach. See. for 
example. U.S. 5.750 J73; US. 5.223.409. etc 

Human antibodies are also with in the scope of the antibodies of the invention. Human antibodies can 
be produced using various techniques known in the art including phage display libraries [Hoogenboom and 
20 Winter. J. \foL BioL. 227:381 H991); Marks et a/.. J. Moi. BioL 222:581 (1991)]. The tecniuques of Cole et 
al. and Boerncr et al. are also available for the preparation of human monoclonal antibodies (Cole et aL. 
Monoclonal Antibodies and Cancer Therapy, Alan R. Liss. p. 77 (1985); Boemer et aL J. Immunol., 147 
£T}:86-95 (1991); U. S. 5,750, 373]. Similarly, human antibodies can be made by introducing of human 
immunoglobulin loci into transgenic animals, e.g.. mice in which the endogenous immunoglobulin genes have 
25 been partially or completely inactivated. Upon challenge, human antibody production is observed which 
closely resembles that seen in humans in all respects, including gene rearrangement, assembly, and antibody 
repertoire. Tnis approach is described, for example, in U.S. Patent Nos. 5.545.807; 5.545.806; 5.569,825: 
5,625.126; 5.633.425; 5,661,016. and in the following scientific publications: Marks et aL Biotechnology ]0. 
779-783 (1992); Lonberg et aL Nature 368 856-859 (1994); Morrison, Nature 368, 812-13 (1994); Fishwild 
eta/.. Nature Biotechnology J4. 845-51 (1996); Neuberger, Nature Biotechnology 14. 826(1996): Lonberg and 
Huszar. Intern. Rev. ImmunoL 13 65-93 (1995). 

"Single-chain FV or "sFV antibody fragments comprise the V„ and V L domains of antibody, wherein 
these domains are present in a single polypeptide chain. Preferably, the Fv polypeptide former comprises a 
polypeptide linker between the V H and V L domains which enables the sFv to form the desired structure for 
antigen binding. For a review of sFv see Pluckthun in The Pharmacology of Monoclonal Antibodies, vol 113. 
Rosenbuig and Moore eds. t Springer- Verlag, New York, pp. 269-315 (1994). 

Tne term "diabodies" refers to small antibody fragments with two antigen-binding sites, which 
fragments comprise a heavy-chain variable domain (V H ) connected to a Ught-chain variable domain (VjJ in 
the same polypeptide chain (V H - V L ). By using a linker that is too short to allow pairing between the two 
domains onthcsamc chain * me domain$ are forced to pair with the complementary domains of another chain 
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and create two antigen-binding sites. Duibodies arc described more fully in. for example, EP 404.097: WO 
93/1 1161: and Hollinger et aL Proc Natl. Acad. ScL USA. 90:6444-6448 (1993). 

The word "label" when used herein refers to a detectable compound or composition which is 
conjugated directly or mdirectly to the compound e.g., antibody or polypeptide, so as to generate a "labelled'' 
compound. The label may be detectable by itself (e.g.. radioisotope labels or fluorescent labels) or. in the case 
of an enzymatic label may catalyze chemical alteration of a substrate compound or composition which is 
detectable. 

By "solid phase" is meant a non-aqucous matrix to which the compound of the present invention can 
adhere. Examples of solid phases encompassed herein include those formed partially or entirely of glass (e.g., 
controlled pore glass), polysaccharides (e.g.. agarose), polyacrylamides, polystyrene, polyvinyl alcohol and 
silicones. In certain embodiments, depending on the context the solid phase can comprise the well of an assay 
plate: in others it is a purification column (e.g.. an affinity chromatography column). This term also includes a 
discontinuous solid phase of discrete panicles, such as those described in U.S. Patent No. 4.275,149. 

The term "immune related disease" means a disease in which a component of the immune system of a 
mammal causes, mediates or otherwise contributes to a morbidity in die mammal. Also included are diseases 
in which stimulation or intervention of the immune response has an ameliorative effect on progression of the 
disease. Included within this term are immune-mediated inflammatory diseases, non-immune-mediated 
inflammatory diseases, infectious diseases, immunodeficiency diseases, neoplasia, etc. 

The term T cell mediated" disease means a disease in which T cells directly or indirectly mediate or 
otherwise contribute to a morbidity in a mnmmaL The T cell mediated disease may be associated with cell 
mediated effects, lympholcinc mediated effects, etc, and even effects associated with B cells if the B ceils are 
stimulated, for example, by the rymphokines secreted by T cells. 

Examples of immune-related and inflammatory diseases, some of which are immune or T cell 
mediated, which can be treated according to the invention include systemic lupus erythematosis. rheumamid 
anhritis. juvenile chronic arthritis, spondyloarthropathies, systemic sclerosis (scleroderma), idiopathic 
inflarnmatory myopathies i dermatomyositis. polymyositis). Sjogren's syndrome, systemic vasculitis, 
sarcoidosis, autoimmune hemolytic anemia lirnraune pancytopenia, paroxysmal nocturnal hemoglobinuria), 
autoimmune thrombocytopenia (idiopathic thrombocytopenic purpura, immune-mediated thrombocytopenia), 
thyroiditis (Grave's disease, Hashimoto's thyroiditis, juvenile lymphocytic myrotditis. atrophic thyroiditis), 
diabetes meliitus. immune-mediated renal disease (glomerulonephritis, mbulointersntial nephritis), 
demyeiiiiaang diseases of the central and peripheral nervous systems such as multiple sclerosis, idiopathic 
demyclinating polyneuropathy or GuUlain-Barre syndrome, and chronic inflammatory demyelinating 
rwiyneuropathy, hepatobiliary diseases such as infectious hepatitis (hepatitis A. B, C, D, E and other non- 
hepatotropic viruses), autoirnmune chronic active hepatitis, primary biliary cirrhosis, granulomatous hepatitis, 
and sclerosing cholangitis, mflatnrnatory bowel disease (ulcerative colitis: Crohn's disease), gluten-sensitive 
ertmropathy, and Whipple's disease, autoimmune or immune-mediated skin diseases including bullous skin 
diseases, erythema multiforme and contact dermatitis, psoriasis, allergic diseases such as asthma, allergic 
rimntis, atopic demands, food hypersensitivity and urticaria, immunologic diseases of the lung such as 
eosinophilic pneumonias, idiopathic pulmonary fibrosis and byiwser^trvity pneumonitis, transplantation 
associated diseases mchuhng graft rejection and graft -versus-host-disease. Infectious diseases including viral 
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diseases such as AIDS (HIV infection), hepatitis A. B. C, D. and E. herpes, etc.. bactehaJ infections, fungal 
infections, protozoal infections and parasitic infections. 

Treatment" » 811 intervention performed with the in tendon of preventing the development or altering 
the pathology of a disorder. Accordingly, "treatmenT refers to both therapeutic treatment and prophylactic or 
preventative measures. Those in need of treatment include those already with the disorder as well as those m 
which the disorder is to be prevented. In treatment of an immune related disease, a therapeutic agent may 
direcdy decrease or increase the magnitude of response of a component of the immune response, or render the 
disease more susceptible to treatment by other therapeutic agents, e.g.. antibiotics, antifungals, anti- 
inflammatory agents, chemotherapeurics. etc. 

The term "effective amount" is at least the minimum concentration or amount of a PRO polypeptide 
and/or agonist/antagonist which causes, induces or results in either a detectable improvement m a component 
of the immune response in mammals as measured in an in \itro assay. For example, an increase or decrease in 
the proliferation of T-celis and/or vascular permeability as measured in Examples provided hercrn. 
Furthermore, a "AerapeuricaUy effective amount- is the minimum concentration or amount of a PRO 
15 polypeptide and/or agonist/antagonist which would be effective in at least ancnuaung a pathology (increasing 
or decreasing as the case may be) a component of the immune response in mammals, the results of which 
effects a treatment as defined in the previous paragraph. 

"Chronic- administration refers to administration of the agem(s) in a continuous mode as opposed to 
an acute mode, so as to maintain the initial therapeutic effect (activity) for an extended period of time. 
•Intermittent" administration is treatment that is not consecutively done without interruption, but rather is 
cyclic in nature. 

The -pathology" of an immune related disease includes all phenomena that compromise the well- 
being of the patient. This includes, without limitation, abnormal or uncontrollable cell growth, antibody 
production, auto-antibody production, complement production and activation, interference with the normal 
25 functioning of neighboring cells, release of cytokines or other secretory products at abnormal levels, 
suppression or aggravation of any inflammatory or immunological response, infiltration of inflammatory cells 
(neutrophilic, eosinophilic, monocyuc. lymphocytic* into tissue spaces, etc. 

"Mammal" for purposes of treatment refers to any animal classified as a mammal, including humans, 
domestic and farm animals, and zoo. sports, or pet animals, such as dogs, horses, cattle, pigs, apes, hams ters, 
30 ferrets, cats, etc Preferably, the mammal is human. 

Administration "in combination with" one or more further therapeutic agents includes simultaneous 
(concurrent) and consecutive administration in any order. 

"Carriers" as used herein include pharmaceutical^ acceptable carriers, excipients. or stabilizers which 
are nontoxic to the cell or mammal being exposed thereto at the dosages and concentrations employed. Often 
35 the physiologically acceptable carrier is an aqueous pH buffered solution. Examples of physiologically 
acceptable carriers include buffers such as phosphate, citrate, and other organic acids; antioxidants including 
awortric acid; low molecular weight (less man about 10 residues) polypeptide: proteins, such as serum 
albwnin, gelatin, or manunoglobulins: hydrophilic polymers such as polyvmylpynolidone; amino acids such as 
grycroe, gkitamine, asparagine, argmine or lysine; monosaccharides, disaccharides, and other carbohydrates 
mcroding glucose, manaose. or dextrin* chelating agents such as EDTA; sugar alcohols such as mannitol or 
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sorbitol: salt-forming countcrions such as sodium; and/or nonionic surfactants such as TWEEN™, 
polyethylene glycol (PEG), and PLURON1CS™. 

The term "cytotoxic agent" as used herein refers to a substance that inhibits or prevents the function of 
cells and/or causes destruction of cells. The terra is intended to include radioactive isotopes {e.g., l m . I 125 . Y*° 
and Re 18 *), chemotherapeuric agents, and toxins such as cnzymatically active toxins of bacterial, fungal, plant 
or animal origin, or fragments thereof. 

A "chcmotherapcutic agent" is a chemical compound useful in the treatment of cancer. Examples of 
cbemotherapeutic agems include adriamycin. doxorubicin, cpirubicin. 5-fluorouracil, cytosine arabinossde 
CAra-C). cyclophosphamide, thiotepa, busulfan. cytoxin. taxoids. c.g.. paclitaxel (TaxoL Bristol-Myers 
Squibb Oncology, Princeton. NJ). and doxetaxel (Taxotere. Rhone-Poulenc Rorer. Antony. France), toxotere. 
methotrexate, cisplatin, melphalan. vinblastine, bleomycin, etoposide. ifosfaraide, mitomycin C mitoxantrone. 
vmcrisrine. vinorelbine. carbopiatin. teniposide. daunomycin. carminomycuu amxnoptcnn. dactmomycin, 
mitomycins, esperamicins (see U.S. Pat. No. 4.675,187), melphalan and other related nitrogen mustards. Also 
included in this definition are hormonal agents that act to regulate or inhibit hormone action on tumors such as 
tamoxifen and onapristone. 

A "growth inhibitory agent" when used herem refers to a compound or composition which inhibits 
growth of a cell, especially cancer cell ovcrexpressing any of the genes identified herein, either in vitro or in 
vivo. Thus, the growth inhibitory agent is one which significantly reduces the percentage of cells 
overexpressing such genes in S phase. Examples of growth inhibitory agents include agents that block cell 
cycle progression (at a place other than S phase), such as agents that induce Gl arrest and M-phasc arrest. 
Classical M-phasc blockers include the vmcas (vincristine and vinblastine), taxol. and topo II inhibitors such as 
doxorubicin, epirubxein, daunorubicin. etoposide. and bleomycin. Those agents that arrest Gl also spill over 
into S-phase arrest, for example, DNA alkylating agents such as tamoxifen, prednisone. dacarbazine, 
mecblorethanunc. cisplatin. methotrexate. 5-nuorouracil. and ara-C Further information can be found in 77ie 
Molecular Basis of Cancer. Mendelsohn and Israel, eds.. Chapter I. entitled "Cell cycle regulation, oncogens, 
and antineoplastic drugs" by Murakami ci ui (WB Saunders: Philadelphia. 1995). especially p. 13. 

The term "cytokine" is a generic term for proteins released by one cell population which act on 
another cell as intercellular mediators. Examples of such cytokines are lymphokines. monokines, and 
traditional polypeptide hormones. Included among the cytokines are growth hormone such as human growth 
hormone, N-methionyl human growth hormone, and bovine growth hormone: parathyroid hormone: thyroxine: 
insulin: proinsulin: relaxin: proreJaxiiu glycoprotein hormones such as follicle stimulating hormone (FSH). 
thyroid stimulating hormone (TSH), and luteinizing hormone (LH); hepatic growth factor, fibroblast growth 
factor prolactin; placental lactogen: tumor necrosis fcctor-a and -p; mullerian-inhibiting substance: mouse 
goiiadotropm-associated peptide: mhibin: activin; vascular endothelial growth factor tntegrin: thrombopoictin 
(TPO); nerve growth factors such as NGF-P; platelet-growth factor transforming growth actors (TGFs) such 
as TGF-a and TGF-p; insulin-like growth factor-1 and -II: erytruopoietin (EPO); osteoinductive factors; 
interferons such as interferon- a, -0, and -y; colony stimulating factors (CSFs) such as macrophage-CSF (M- 
CSF): grariulocyte-macrophage-CSF (GM-CSF); and granuiocyte-CSF (G-CSF); mterieukins (ILs) such as IL- 
l, IL-la, IL-2, IL-3. IL-4, IL-5, IL-6. IL-7. H.-8, IL-9, IL-il, IL.12; a tumor necrosis factor such as TNF-a or 
TNF-p; and other polypeptide factors including UF and kit ligand (KL). As used herein, the term cytokine 
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includes proteins from natural sources or from recombinant cell culture and biologically active equivalents of 
the native sequence cytokines. 

The term "epitope tagged" when used herein refers to a chimeric polypeptide comprising a PRO 
polypeptide fused to a "tag polypeptide". The tag polypeptide has enough residues to provide an epitope 
against which an antibody can be made, yet is short enough such that it does not interfere with activity of the 
polypeptide to which it is fused. The tag polypeptide preferably also is fairly unique so that the antibody does 
not substantially cross-react with other epitopes. Suitable tag polypeptides generally have at least six amino 
acid residues and usually between about 8 and 50 amino acid residues (preferably, between about 10 and 20 
amino acid residues). 

"Active* or "activity" in the context of variants of the PRO polypeptide refers to form(s) of proteins of 
the invention which retain the biologic and/or the ability to induce the production of an antibody against an 
antigenic epitope possessed by the PRO polypeptide. More specifically, "biological activity" refers to a 
biological function (either inhibitory or stimulatory) caused by a native sequence or natiu^Dy-occurring PRO 
polypeptide. Even more specifically, "biological activity in the context of an antibody or another molecule 
dial can be identified by the screening assays disclosed herem <c.y.. an organic or inorganic small molecule, 
peptide, cic.) can be the ability of such molecules to induce or inhibit infiltration of mfiainmatory cells into a 
tissue io stimulate or inhibit T-ceil proliferation or activation, to stimulate or inhibit cytokine release by cells 
or to increase or decrease vascular permeability. Another specific biological activity is the increased vascular 
perroeabiliry or the inhibition thereof. 

The term "antagonist" is used in the broadest sense, and includes any molecule that partially or folly 
blocks, inhibits, or neutralizes a biological activity of a native sequence PRO polypeptide disclosed herein. In 
a similar manner, the term "agonist" is used in the broadest sense and includes any molecule that mimics or 
amplifies a biological activity of a native sequence PRO polypeptide disclosed herein. Suitable agonist or 
antagonist molecules specifically include agonist or antagonist antibodies or antibody fragments, fragments or 
25 amino acid sequence variants of native PRO polypeptides, peptides, small organic molecules, etc. Methods for 
ioentifying agonists or antagonists of a PRO polypeptide may comprise contacung a PRO polypeptide with a 
candidate agonist or antagonist molecule and measuring a detectable change in one or more biological 
activities normally associated with the same. 

A "small molecule" is defined herein io have a molecular weight below about 600 daltons. and is 
30 generally an organic compound. 

A liposome" is a small vesicle composed of various types of lipids, phospholipids and/or surfactant 
which is usenii for delivery of a drug (optionally including a chemotherapeutic agent) to a mammal The 
components of the liposome are commonly arranged in a bilayer formation, similar to the lipid arrangement of 
biological membranes. 

35 As used herein, the term "inimunoadhesin" designates antibody-like molecules which combine the 

binding specificity of a heterologous protein (an "adhesm") with the effector functions of immunoglobulin 
constant domains. Structurally, the mimunoadhesins comprise a fusion of an amino acid sequence with the 
desired binding specificity which is other than the antigen recognition and binding site of an antibody is 
"heterologous"), and an immunoglobulin constant domain sequence. The adhesin parr of an t~™™<.rfh~m 
molecule typically is a contiguous amino acid sequence comprising at least the binding sice of a receptor or a 
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ligaud. The immunoglobulin constant domain sequence in the immunoadhesin may be obtained from any 
miinm^oglobuiiiu^iich as IgC-l. IgG-2. lgC-3, or lgG-4 subtypes, IgA (including IgA-l and IgA-2). IgE, IgD 
orlgM 
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mt .dayt26|(261 « { 

/• ABCDEFCHIJKLM N0PQRSTUVWXY2V 
/• A •/ { 2. 0.-2. 0. 0.-4. 1,1,1, 0,1,2,1. 0. M. 1. 0.-2. 1. 1.0. 0.-6. 0 -3 0) 
/• B •/ { 0. 3.-4. 3. 2.-5. 0. 1.-2. 0. 0.-3.-2. 2.~M,I. I. 0. 0. 0. 0-2-5 0 -3 1} 
/■ C •/ {-2.-*;15,5,5.-4,3,3,2. 0.-5.-6.-5.-47 M.-3.-5.-4. 0.-2. 0,*2.-8 0 0 -5} 
/•DV {0.3.-5.4.3.-6. 1.1,2.0.0,4,3. 2. M,l. 2.-1. 0. 0. 0.-2 -7 0^ 2) 
/• E •/ { 0. 2. -5. 3. 4.-5. 0. 1.-2. 0. 0.-3.-2. I.~M,I. 2.-1 0 0 0 -2 -7 0 -4 31 
'* V 9 H-' 5 -- 4 " 6 " 5 - MA K 0.-5. 2. 0.-4;m..5.-5.^.-3.-3. 0.-1. 0. 0. 7,5}, 
9* C •/ {I. 0.-3. I. 0.-5. 5.-2.-3. 0.-2M.-3. 0. M,l,l.-3. I. 0. 0.-1 -7 0 -5 01 

/•H '/ {-1. 1,-3. 1. l^.-2.6.-2.0.0.-2..2.2."M.0.3.2.-l.-1.0.-2 , .3*o'o'2l 

/■ ! '/ {-1.-2.-2,2.-2. 1,3.-2. 5. 0,2. 2. 2,2." M, 2,2. -2,1. 0. 0. 4,5 0 -1-2} 

/• J •/ { 0. 0. 0. 0. 0. 0. 0. 0. 0. 0. 0. 0. 0. 0. M. 0.0.0.0 0.0 0 0 0 0 0) 

/• K •/ (-1. 0,5. 0. 0,5,2. 0,2. 0. 5,3. 0. |.~M..|. 1.3. 0. 0. 0 -2 -3 0 -4 0) 

/• L •/ {.2,3.-6.-4,3. 2.-4,2. 2. 0,3. 6. 4,3." M, 3,2,3,3,1. 0. 2,2 0 -1 A 

25 /• M •/ {-1,2.-5,3.-2, 0,3,2. 2. 0. 0. 4. 6,2.~M,2,1 0 -2 -1 0 2 -4*oY-i} 

/• N ■/ { 0. 2.-4. 2. I .-I. 0. 2,2. 0. 1,3,2. 2. M.-l. 1.0. I. 0. 0 -2**-4 V-2." 1) 

/•P*/ { 1,1,3,1,1,5,1. 0,2. 0,1.-3,2,1. M. 6. 0.0. 1.0.0,1.-6 0 -5* of 

/-QV {0. 1,5.2.2,5,1.3,2.0. 1,2,1. I. M.0.4. 1,1,1.0,2,5 0.^*3} 

30 /• R '/ {-2. 0.-4,1, 1. -4,3. 2,2. 0. 3,3. 0. 0. M. 0. 1. 6. 0 -1 0 -2. 0-4 01* 

/•S-/ ( 1.0.0.0.0,3. 1,1,1.0.0,3,2. I.'M. 1,1.0.2. 1*0-1 -2LoYoi 

/•T-/ { 1.0,2. 0.0,3. 0,1. 0.0. 0,1.-1. 0.*M. 0,1,1. I. 3. 0, 0,5. oV 01 

/*UV {0. 0. 0.0.0.0.0.0.0.0.0.0.0.0. "m.o.o.o.o. 0.0.0 0 0 0 01 

V •/ { 0,2,2,2,2,1,1.-2. 4. 0,2. 2. 2,2." M,l.-2,2,1. 0. 0. M O W! 

/•XV {0.0.0.0.0.0.0.0.0.0.0,0.0.0. M. 0.0.0.0 0 0 0 0 0 0 0) 

/■ Y V {-3,3. 0,-4.-4. 7,5. 0,1. 0.-4,1 ,2,27m,5.^,3,3, 0,2. 0. 0.10.-4}. 



40 



/• ZV {0. 1,5. 2. 3,5. 0. 2,2. 0. 0,2,1. I.Jm. O.Yo. O. 0. 0,2,-6. 0M. 4} 



45 



50 



Page 1 of day.h 
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Tabic I tcont*) 



^include <sidio.fa> 




^indode <ctype.h> 




fdefme MAXJMP 


16 


^define MAXCAP 


24 


^define JMPS 


1024 


^define MX 


4 


^define DMAT 


3 


^define DM15 


0 


^define DINSO 


8 


/^define DINSI 


1 


fttefine PINSO 


6 


f define P1NS1 


4 


struct jmp ( 





short 



/• max jumps in a diag */ 

/* doo't continue u> penalize gaps larger than this */ 

max jmps in an path V 
/* save it there's at least MX-t bases since last jmp •/ 

/■ value of matching bases m f 

/• penalty tor mismatched bases •/ 

/* penalty for a gap */ 

/* penalty per base •/ 

/• penalty for a gap */ 

/• penalty per residue */ 



short 



}: 

struct thac { 
int 
long 
snort 



n( MAXJMP): 
x(MAXJMP): 



jmp 



score; 
offset 
ymp; 
jp: 



/* size of jmp (neg for dely) •/ 

base no. of jmp in seq x */ 
t* limits seq to 2 " 16-1 */ 



/• score at last jmp */ 
/• oflset of prev block ■/ 
/* current ;mp index •/ 
/• list of jmps •/ 



path{ 
int 



^PCJ /" number of leading spaces */ 

n( JMPS]: /* size of jmp tgap; •/ 

x[JMPS].7» loc of jmp (last elem before gap) •/ 



char 

char 

char 

char 

int 

int 

int 

int 

int 

int 



struct 



diag 
path 



•ofile: 

•namex|2|: 

•prog: 

•*q*|2|: 

dmax; 

draaxO: 

dna: 



IenO. lenh 
ngapx. ngapy: 



■dx: 
PP(2); 



/* 
/• 
/• 
»• 

t* 
I* 
/♦ 
/• 
/• 
/' 
/' 
/' 
/< 
/' 
/« 



output file name •/ 
seq names: geiseqsO •/ 
prog name for err msgs */ 
seqs: getseqst j */ 
1 best diag: nwi > •/ 
1 final diag V 
' set if ana: main() •/ 
1 set if penaiizinc end gaps •/ 
1 total gaps in seqs */ 
seq lens •/ 
total size of gaps */ 
max score: nw() •/ 
binnap tor matching */ 
current offset in jmp file */ 
holds diagonals */ 
holds path for seqs */ 



char 



•cillccO. •malloco. •bdexO, "strcpyO; 
•getseqO. m gjaUacO; 
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Table 1 fcnm') 

/■ Herd] rm an- Wunsch ^'gm^m program 
• 

•usage: progs fiiel fUc2 

• where file I and filc2 are cwo dna or iwo protein sequences. 

• The sequences can be in upper- or lower-case an may coniain ambiguiry 
■ Any lines beginning with ':'.'>* or '<* are ignored 

• Ma* file length is 65535 (limited by unsigned short x in the jrap struct) 

• A sequence with 1/3 or more of its elements ACGTU is assumed to he DNA 

• Oinpui is in the file "align.au' 
• 

• The program may create a trop file in /imp to hold info about tracenack. 

• Original version developed under BSD 4.3 on a vax 8650 
•/ 

Andude 'nvt.h* 
finctude "day.h" 



static _Jbvair26J » { 

^ T. 14.2. 13.0.0.4 . 1 1 .0.0. 12.0.3. 15.0.0,0.5.6.8.8.7,9.0. 10,0 

static _phvalf261 ■ { 

I. 2lf I < < fP--'A' ))|(l < <f'NVA>). 4. 8. 16. 32 64 
128. 256. OxFFFFFFF. I< < 10. 1< < 1 1. I < < 12. I < < 13 1 < <]4 
K<t5. 1<<I6. I<<17. K<I8. K<I9. 1< <20. \<<V 1«22 
1< C23. 1< <24. I < <25j(l < <C-EVA'»|CI < «QVA'» 



mamlac. av) 

int ac: 
cbar *avfj; 

{ 

prog - av|0]: 
if(ac!~3){ 

fprWfstderr. "usage: Its filel file2\n\ prog): 

rprinrftstderr/where tile! and file2 are rwo dna or two protein sequences.Vn"): 
rprmtfTstdcrr.-The sequences can he in upper- or lower-caseVn"): 
rprinttTstdcrr.-Any lines beginning with V or * <* are ignoredui*)- 
fprintftstderr. "Output is in the Tile \'align.outr\n~V 
exiid): 

} 

namexfO] = avf l|; 

namexfll = »v[2|; 

seoxfO) « getseqtnamexJOI. <fclenO): 

seqxj 1 1 = getseqt namex| 1 1. Sdtol ): 

xbm o (dna)? _dbval : jbval: 



0: '* I to penalize endgaps •/ 

ofite o "align.oui"; /« fflB m/ 



/* fill in the matrix, go the possible jmps •/ 
nao^mpsO: /* get the actual jmps */ 

primO: /• prim sua. alignment •/ 

cieamnXO): /• unlink any tmp files V 
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Table I jcopfl 

/* do the alignment, return best score: mainO 

* dna: value* in Rich and Smith. PNAS. 80. 1382-1386. 1983 

* pro: PAM 250 values 

5 * When scores ire equal, we prefer mismatches to any gap. prefer 

* a new gap to extending an ongoing gap. and prefer a gap in seqx 

* to a gap in seq y. 
•/ 

nwO 
10 { 





char 


*px. *py: 


/■ seqs and ptrs */ 




int 


•ndely. -dely: 


/• keep track of dely */ 




int 


addx, delx: 


/•keep track of delx V 


15 


int 


•tmp: 


/• for swapping rowO. rowl */ 


int 


mis: 


/• score for each type */ 




int 


insO. insl: 


/' insertion penalties •/ 




register 


id: 


/• diagonal index */ 




register 
register 


■j: 

•colO, •coll; 


/•jmp index •/ 

/• score for curr. last row •/ 


20 


register 


xx. yy: 


/• index into seqs •/ 



dx -= (struct diac *)c_calloct "to tset diags". lenO-Men! + 1 . sizeoft struct diae)K 

ndely ~ tint •>g_caUocrto get ndely". lenl -H. sizeoftfntu: 
2.S dely * tint •)gjcalloci "to get dely". Icnl + l, sizeoffint)): 

colO » (int ")E_caUoc("to get colO". lenl + 1. saeofUni)): 
coll « (int *)c callocrto get coll", lenl + 1. siaeotfint)): 
insO - <dna>? DINS0 : PINS0: 
insl «(dna)?DlNSl : PINSl: 

30 

smax « -10000: 
if (endgapsj { 

for (col0[0) « dely|01 « -insO. yy - 1: yy < « lenl; yy+ +) { 
col0(yyj = delyjyyj = col0|yy-lj - insl; 
35 ndelylyyl - yy: 

} 

coiOfOJ - 0: /• Waterman Bull Math Biol 84 •/ 

) 

else 

40 for(yy = I; yy < « lenl: yy+ + l 

delylyy] - -insO: 

/• fill in match matrix 
7 

45 for(px - seqx(0], xx - 1: xx lenO: px + + . xx+ + ){ 

/* mmalize first entry in col 
V 

lf(endgaps){ 

IT(xx «« 1) 

50 collfOl - ddx - -(msO-Knsl); 



} 

55 ebe{ 



60 



coUfO] - delx « colOfOl - insl: 
xx: 



colirO] - 0; 
delx » -insO; 
nddx - 0: 
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Tabfc 1 fconi'i 

forfpy -seqxfll.yy = i: yy < = lenl: py++. yy ++) { 
mis ■ coK)fyy-l|; 
if(dna) 

mis + - 'xbmCpx- A'Jixbmi-py. A I)? DMAT : DMIS: 
mis *«.<layl-px-'A-!l*py-'A'|: 

/• update penalty for del in x seq: 

• favor new del over ongong del 

* ignore MAXGAP if weighting end gaps 

If Cendgaps 1 1 ndelyfyyj < MAXGAP) { 

if (colOlyyj - insO > ■ delytyyl) { 

<Wytyy| . coKHyy] . (insO+insl): 
ndefy(yyj = | : 

}ebe{ 

, n «Wy(yy| -« msi; . 

20 ndeJy(yy| + + : 



10 



15 



30 



35 



40 



45 



50 



55 



60 



}e*$e{ 



} 



, 5 ^lylyyl * cnlOfyyl . nnso-finsl): 

ndelylyy| - | : 



if (col0|yyf . iinsO+insI > > - delyfyyj) { 
}ebe 



> 



ndeiytyy|+i 



/• update penalty tor del in y seq: 
• favor new del over ongong del 

if (endgaps 1 1 ndeix < MAXGAP) { 

tf (co!l!yy-l| . insO >= deU) { 

<«x = coll(yy-||.(insO+insl); 
ndelx « 1: 

delx -«= insl; 
nddx++; 

} 

if fcoH|yy-|| -linsO+insli > = delx) { 
delx »coll[yy.lj.(i a5 0+insi); 
ndelx o l; 

} dse 

ndelx+ + : 

} 



/ pick the maximum score: we're favoring 
• nm over any del and delx over defy 
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Table 1 (contM 
id - xx - yy -Hen! - 1; 

M jnr 

if (mis > = dclx &Il mis > * dely[yyj) 

coll[yy) - mis: 
else if (dclx > = dely{yy)) { 

collfyyj = delx: 

ij « dxjidl.ijmp: 

if fdx[idi.jp.n|0| &Jl f!dna 1 1 (odetx > - MAXJMP 
&& xx > dx(id].jp.x(ij)+MX) 1 1 mis > dx[id|.scorc+DINSO)) { 
dx(id).ijmp++: 
if(++ij >~ MAXJMP) { 
wriiejmpsiid): 
ij « dxfidj.ijmp » 0: 
dxfidj.offsei » offset: 

offset + - sizeoftstruct jmpj + sizeotf offset): 

} } 

dxfidljp.nhjl « nddx: 
<k[id|.jp.xfij] m xx: 
dxjid|.sc«re - delx: 

} 

else{ 

coll[yy| « detylyyj: 
ij = dx(id|.ijmp: 
if (dx!id|.jp.n|0I <!dna I j (ndely(yy) > - MAXJMP 

&& xx > dx|id|jp.xjij) + MX) 1 1 mis > dx|idJ.score+DtNSO)) { 
dxfid|.ijmpt»+: 
if{++,j >« MAXJMP) { 
writejmpsiid): 
ij - dxf idj.ijmp = 0: 
dx(id|.offsct = offset: 

offset + = soeofil struct jmp> + sizeoff offset): 

} 

dx|id|.jp.n|ij| - -ndelyiyy): 
dxfid|.jp.xfij) « xx: 
dxjid|.score ■ deiyfyyj; 

if (xx lenO &c&. yy < ienl) { 
/' last col 
•/ 

if <endgaps> 

colllyyj-. insO+tnsl*(ienl-yy): 
if (coU[yy) > smax) { 

smxx « coll(yy); 

dmax - id: 

> 

} 

} 

tf(endgaps<fc&xx < lenO) 

coU[yy-l] -= msO+inslMcnO-xx): 
If (coll [yy-]} > smax) { 

smax * coll|yy-l); 

dmax m id: 

} 

tmp * colO: colO - coil; coll * nnp: 

} 

(▼old) freetfcfaar »)ndely): 
(wid)free((dMr*)dely): 
(void) free<(char *)coK»: 
(void) freettchar *)coll); 
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Table I (comM 

■ 

• primO - only routine visible outside this module 

5 

* static: 

■ getmm) - trace back best path, count matches: primf) 

' PT.*ti gpQ - prim alignment of described in array p(J : print) 

• dumpblockO - dump a block of lines with numbers, sun: pr aiignO 
10 * numso - put out a number line: dumpblockf) 

• putiineO - put out a line (name. [nura|. seq. (num|): dumpblockf) 

* stars(> * -put a line of stars: dumpblockf) 

* stripnamet) - strip any path and prefix from a seqname 

15 



30 



35 



50 



'define SPC 3 

#defin*P_UNE 256 /• iiiaxixmira output line V 



20 Idefme P'SPC 3 
,dayI261f261 

inl 
FILE 

25 



inl 


olen; 


FILE 


•fx: 


primfj 




{ 


print 




int 




if«fx 



/* space between name or nuro and seq "/ 



/• set output line length *' 
/• output file *' 



lx, ry, firstgap. lastgap; /• overlap V 



fopenfofile, *w*)) «« 0) { 
JprintftKderr/5[s: can t write %s\n*. prog, ofile): 
ciemapt l ): 

fprinrftfx. 'Kfottxquaia: Isflengih - 5<mn\ namex|0) ItnOV 
fprroff&^<seeood sequence: *s (length = *<D\a-. nanrxlH. lenl): 
olen 60; 
lx » tenO: 
ly = lent: 
40 firstgap « lastgap « 0: 

lf(dmax < lent - 1) { /• leading gap in x V 
pp|0|.spc - firstgap * lenl - Umax - 1; 
ly-«ppl0).spc: 

45 else if (dmax > lenl - 1) { /• leading gap in y •/ 

pp(IJ.spc » firstgap « dmax - (lenl - 1); 
lx— ppllj.spc; 

if(dmaxO < lenO- I) { /• trailing gap jot 1 / 

lastgap - lenO- dmaxO -I; 
^ lx — lastgap; 

eta* (dmaxO > lenO- 1) {/• trailing gap in y V 
lastgap = dmaxO - (tenO - 1); 
55 ry -« lastgap: 

getmatflx, ly, firstgap, lastgap); 
pr^alignO: 

} 

60 
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Table 1 (conn 

* trace bad* die best pack count matches 
•/ 

static 

getmatflx. ly. firstgap. last gap) 

* u » to I* •core" (minus end gaps) */ 

int fetgap. lastgap: /* lading wailing overlap V 



{ 



tot nm. iO, il. sizO. sizl; 

char outx(321; 

pa: 
nO. nl: 

register char »p0. *pl: 

/• get total matches, score 
»/ 

iO « il - sizO - sizl - 0: 
pO « seqxiOl + pp(lj.spc: 
pi • seqx|l) + pp<0|.spc: 
nO *= pplij.spc + 1; 
nl - pp|0|.spc + t: 

ran m 0: 

whaet *nU&&*pI i { 
if(sizO){ 

pl + + : 
nl+ + ; 
sizO-; 

) 

else if (sizl ){ 

P0+-K: 
nO+ + : 
sizl-: 



if (xbmf *p0- * A ' I&xbmj *p I - ' A' )) 

nm + + : 
If(n0+^- -=pp{0].xfi01) 

- PP(01.nfiO++J: 
if(nt + - ==pp(lU|il|) 

*kl -PPllJ.n|iI^*|: 

pO++: 
pl + + : 



} 

P pa homology: 

* if penalizing endgaps. base is the shorter seq 
♦die. knock off overhangs and take shorter core 

if (endgaps) 

Ix-(len0< »eni)?!enO:ienl; 

else 

U - (U < ly)? lx : ry; 
pet »J OO.*(doiihie)nnv(double)U: 
rprincftfX *\n*); 

frrimlCfe. " < *d matches in an overlap of %i: SE.2f percem similariryVn 
nm.(nm— t)?»: -«Mx. pet): 
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Tabte I fcam»i 
fprintflfx, " < gaps in first sequence: Sd\ napxr 
lf(gtpx){ * 
(void) sprimffoutx. " (%4 %s%g)\ 

ngap** <dna)? "base*: "residue", (ngapx « |y> **:V); 
rprimflfx."%s". outx): 

fprmtftfx. gaps in second sequence: £d\ gapy;- 

(void) sprmrrtouix. " (Sd Ssfcs)". 

ngapy. idna)? ">use": "residue \ (ngapy =«= |)? - : - $ -) ; 
fprintfffx.*%s*. ouu); 

} 

if(dna> 

IS . fprinrflfx. 

"\n< score: %d (match = Sd. mismatch « 5&d. gap penalrv = %d + %d per base)\n" 
smax. DMAT. DM IS. DINSO. DINS I) ^ ' ^ 

else 

iprimtffx. 

i^Pnm PInITJT 17 ^ murU - W Poalry = *d + *d per 
If fendgaps) 

fprinttffx. 

'<endpaps penalized. left eodcap: 5d 5s%s. richi endcap: Sd %s%s\n\ 
hrsigap. (dna>7 "base" ; "residue-, (firsigap m = "| >? - ' v 
lasigap. ldna>' 'base" : "residue-, (bsigap |y» - . "•*-)• 



20 



else 



fprimfffx. " < endgaps noi penaioedta"): 



30 



35 



stalk 


on: 


/■ 


static 


(max; 


/• 


static 


ij[2J: 


/• 


static 


dc[2); 


/• 


static 


nU2|; 


/• 


static 






static char 


•ps[21: 


/•i 




•potfj: 


/•l 


static char 


outtfRP UNE1: 


/• i 


stank char 


sorfP LINE]: 


••* i 



40 

/ 

• prim a] teaman ot described in struct path ppf] 

45 static 

pr_align() 

{ pr_align 
nm /• char « 

50 



55 



far(i = 0. Imax «= 0; i < 2; i++) { 
mi m sii ipnamq nampir[i]); 
if (on > hnax) 

" nn: 



ncfi] - I; 
mTi] - 1: 

^ MfO - ijfil - 0; 

60 l»(n « scqxfil; 

pofi) - outjij; 
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Table I (com*) 

for (nn = tun « 0. more » I: more: ) { 

for(i » more « 0; i < 2: i + +) { 
/• 

* do we have more of this sequence? 



10 more+ + : 

*f (PPfiJ.spc) { /• leading space */ 
^ ^ pptfj.spc--: 

elsetf(siz[i]) { /"hi a gap*/ 

20 die ( /• we're pairing a seq element 

♦/ 

•pofil - *psfi) : 
if (islowert *psft|)) 

*psli| = »>uppen*psfi|): 

pofiK*^: 
psfi| + + : 

/• 

• are we at next gap for this seq? 
30 •/ 

if(m[ij " pplij.xfij(il)) { 
f 

* we need to merge all gaps 

* at this location 

35 mf 

=pp(H.nfijfiI+ + |; 
while (ni(i) » pp(iJ.x[tj|iU) 

sizfil +.pplij.nfu[il+ + l; 

nim+ + : 

} 

} 

if( + +nn -»nlen (| !more SlSl nn) { 
dumpblockO; 
for(i * 0; i < 2; i++> 
po{i) - outfi); 

} " n " a 



Ntanp a block of lines, including numbers, stars: pr_aiign() 



dumpblockO 

{ 



r(i «0:i <ii++) 
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Table 1 fconiM 

(wold) putcf \n\ fit): 
forfi -0:i < 2;i + +){ 

if ComliJ ^*outfi] != ' ' || •(poIiJ>!« •)){ 
if(i ==0) 

nnxnsfi): 
tf(i ~=0&& *outf J |> 

starsO; 
putlinefi): 

tf(i 0&A *ouiil|) 
fprirafrfx, star): 

»<«--i> 

1J rnussCi); 
} 

} 

20 /• 

• pin oui a Dumber line: dumpblockO 
•/ 

static 

mimsiixj 

25 «: '* »*kx in out|| holding seq line V 

char nJine[P_LINEl: 
register i. j; 

register char 'pn, *px. *py: 



30 



35 



45 



55 



for(pn = nime. i = 0: i < Inux+PJPC: i + +.pn++) 
•pn = * 

for(i - ncfix). py = out(U|; *py: py + +. pn-n-) / 

tf (*py ' " II 'py «-'-•) 



ebe{ 

iT(i%10«0||(i=- l&&nc[U)!» !)){ 
j - (i < 0)? -i : i; 
for(px «pn:j;j/= io. px -j 
^px =j*J0 + O': 

if(i < 0) 

> 

else 

*pn m ' 

} 



> 

•pa - 'SO': 
50 ncftx] « i : 



far<pn - niine: *pn; pn++) 
(*oM) puicfpn. fx); 
(*oU)pmc('\n\ ft): 



• pm oui a line (name, (num]. seq. |numl): dumpblockO 



italic 

60 puzlinefix) 

fat ix; 

{ 
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Table t fcotit*) 



for(px - oamcxfix). i m 0: *px&& -px !- V: px++> 

(void) putct*px. fx): 
for(: i < lmax+P_SPC: i + +) 

(void)puta' fx): 

/* these count from I: 
•nif] is current element I from J) 
* nc[| is number at scan of current line 
•/ 

for <px ■ outfixl: m px: px + + ) 

(void) puttCpx&0x7F. fx): 
(void) putci Vn'. fx): 



* put a line ot stars <seqs always in out|0I. om| 1 1): dumpblockO 

static 
starso 

stars 

{ 

register char *p0. *pl. ex. *px; 

if(!»out|0| || Cout|0| «(po|0|) - . • •> II 

!*»w(ii ii rowin • &i«(ponj) „«•• •)) 



} 



ps — star 

for (i «= imax+P SPC: i: i-> 

•px++"«"; 

ror(pO - owfOI, Pi - owll J: *p0 *p|; p0+ + . pi ++) { 
If fisalpharpO) <fc& isalpha(*p] )) { 

^(xbml-pO-'A'I^xbml-pi. A J) { 

CX m 

} 

«teif(!dna&&^dayf*pO-'A-H*pN'AM > 0) 
CX - V; 



} 

etst 

CX - 

•px++ - ex; 

} 

*px+ + - '\n*; 
V - *>0': 
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Table 1 (conV) 

r* 

* strip path or prefix from pn. return ien: pr aJignf) 
•/ 

static 

stripnamefpn) 

char *pn: /* file name (may be path) */ 



{ 



register char *px. •py: 
py «0: 

for(px = pn; *px; p*+ + ) 
if(*px«- T) 

py = px + I: 

(void) strcpytpn. py); 
retuntfsirlen(pn)); 
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Table 1 (com') 

/* — 

• cleanups - cleanup any tmp file 

• getseqO - read in seq. set daa. I en. maxlen 

• gjaliocf) - caJiocO with error checkin 

• readjmpsO - get the good jmps. from imp tile if necessary 

• wriiejmpsO - write a filled array of jmps to a tmp file: nwt > 
V 

linctude "nw.h* 
tfndude <sys/fiie.h> 

char •jname «= Vtnm/bomgXXXXXX': /• tmp file for imps »/ 

FILE 

int clewpo: /. dcaxwp ^ fi|c .y 

long IseekO; 

/• 

• remove any tmp file if we blow 
•/ 

cleamifXi) 

int i: 

{ 

if <fj> 

(void) uniinki jname r. 

exhCi): 

} 

/• 

• read, return ptr to set), set dna. len. maxlen 

• skip lines starting with *<\ or *>' 

• seq in upper or lower case 
V 



getseo/file. len) 

char "file: /• file name •/ 
Int 'len: /• seq ten '/ 



linen024). *p Se q : 
register char *px, *py; 
»* natge. tlen: 

FILE 'fp: 

if <(fp - topen(file.-r w )) «* 0> { 

fprintf(siderr.~%s: can t read %s\n\ proc. file): 
exhtl); 

} 

tlen - nargc - 0: 

e(fgets(line. 1024. n>» { 

if (Tine y || -line — [( . lbie »>.j 



for(px - line: *px !- \n*: px++) 

if (isuppert*px) 1 1 isiowerfpx)) 
tlen++: 

} 

1* «PK*1 - ma^(unsignedHtten+6))) - - 0) { 

fenntfsiderr/%s: mallocO failed to get %d bytes for %s\n\ prog, den+6. ffle): 
exs(l); 

} 

P*tf>] = PseqlU = pseq[2] - pseq[3] - •«>•; 
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Table I (com') 



0 



5 



py - pscq + 4: 
*ten - den: 
rcwind(fp); 

wbue(fgets(hne. 1024. fp)> { 

if(*Une « V || •line -» *<• || „ ■>•) 



forfpx « line: »px != *\n': px++) { 
if (isuppen'px)) 

*Py++ - *px; 
ebeif (is!owen*px)) 

*py+ + =» loupperfpx): 
IT (indexC ATGCU", *(py- i ») 

natgc+ + : 



} 



> 

(void) fclosctfp): 

dna « natcc > ulen/3): 

rcfunwpseq«i-4); 



char 

g_caliocimsg. nx. sz> 

char *msg; 
int nx, su 

{ 



/* program, calling routine */ 
/• number and sire or dements «/ 



•px. •callocO; 



g_olloc 



If ((px e caHoc(( unsigned inx, (unsigned >«ct» 
ir(*msg,{ 

forimffstderr. '%*: B _callocO failed %s (o = *d. sz=%*)\n\ prog. msg. nx. sz): 
exit(j); 



) 

return/ pxt: 



•^get final jmps trom dx| | or tmp file, set pp|]. reset dmax: 

readpnpsO 
{ 



int 
mt 



fd 

siz, iO. tl; 



i. J. xx; 



(▼oid) fclose<0): 

If ((Id - openQname. 0_RDONLY. 0)) < 0) { 

^ ttf ^^ TT ' m%i: caD ' t open() 5&5\n*. prog, jname): 

} 



i): 



) 

f©r(i«i0 = il o0. 

*(»{ 

for (j - dxfdmaxl.ijmp; j > = 0AA dxfdmaxl.jp.xlj] > = xx; j-) 



dmax. xx «len0;;i++){ 



45 



SUBSTITUTE SHEET (RULE 26) 

(0 2003 Copyright Derwe.nl Information Ltd. 



WO 00/53758 



PCT/US00/D5841 



Table j fcont'l 

if (j < 0 && dx|dmax|.oflset <&& fj) { 

(void) IsecWfd, dx/dmax|.ofEset. 0); 

(void) readffd. (char *Vkdxfdnaxhjp. sizeoffstnict jmp)): 

(votd) readud. (char *>&dx|dnuxUfls«. sizeofrdx|dmaxi.o£bct))- 

^ dxfdmaxUjmp - MAXMP.); 

else 

break: 

} 

if{i >» JMPS){ 

frrimftstderr. '%y. too many gaps in alignment", prog): 
cleamip(i): 

HQ > - 0) { 

siz ■ dxfdraax|.jp.n(jj; 
xx - dx(dmax|jp.xyj; 
dmax += siz; 

if<siz<0){ /• gap in second seq*/ 

pp{l).n[il| « siz: 
xx +- sir; 

id - xx -yy -Men! - 1 

•/ 

PPiU.x|il| = xx • dmax + lenl - I 
gapy+ + : 
ngapy -= siz: 
/• ignore MAXCAP when doing endgaps •/ 

six » (-siz < MAXCAP 1 1 endgaps)? -siz : MAXGAP: 

} 

eise if (siz > 0) { /• gap in first seq •/ 
ppfOJ.nfiOl = siz: 
K>l0).xri0J m xx: 
gapx++: 
ngapx + » siz: 
/* ignore MAXGAP when doing endgaps *7 

siz - (siz < MAXGAP j | endgaps)? siz : MAXGAP: 
i0+ + : 

} 

) 

dse 

break: 

} 

/• reverse the order of jxnps 
•/ 

for0 = 0. K)-:j < i0;j++, ©-){ 

'* " fSS ^ PP, ° J,nU1 " «*M«: PPfOKnfiOl - i: 
} ' " PPCOJ^OJ: pp[0J.x(j] - pp{0 W i0): pptOMPOJ « i; 

forQ -0, il-;j < il:j++.U->{ 

! Z SJi^T 2M " PP,I J' n » i1 ^ PPlIWUI - i: 

1 - rtfiUbl; PPiU.xOJ - pp|ii.x[ii|; ppiij-xlii] - i: 

if(fd >«0) 

(void) ctose(fd): 

(void) tmlink(jnxme); 

5-0; 

offer * 0; 

} 
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Table I (com') 

/• 

• write a filled jmp struct offset of [be prev one (if any): nw<) 

w riiejmp5( U) writejmps 
int ix: 

( 

char -mtaempO: 
*(!$){ 

if (mtaempt jname) < 0) { 

fprimftsiderr. can i mkremp() %s\n\ prog, jname): 
cleanxtp(I ): 

if ((fj » fopenOname. = « o> { 

tprimftsderr. *%s: can i write %j\n\ prog, jname): 
exit(l): 

} 

> 

(void) fwrite* (char *)&dx{ix|.jp. sfeeofitfruct jmp). |. Q); 
(void) rwritettcfaar *l&dx|ix|. offset. siMof(dxfU|.offset). I. Q); 
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PRO XXXXXXXXXXXXXXX (Length - 15 ammo tads) 

5 Comparison Protein XXXXXYYYYYYY (Length « 12 a mmo acids) 

% amino acid sequence identity « 

(the number of identically matching amino acid residues between the two polypeptide sequences as determined by 
1 0 ALIGN-2) divided by (the total number of amino acid residues of the PRO polypeptide) - 

5 divided by 15 - 33.3% 
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Table 3 

™° XXXXXXXXXX (Length = 10 amino acids) 

Companson Protein XXXXXYYYYYYZZYZ (Length = 15 amino acids) 

% amino acid sequence identity » 

(the ntanber of identical matching amino acid residues between the two polypeptide sequences as determined by 
AUGN-2) divided by (the total number of amino acid residues of the PRO polypeptide) * 

5 divided by 10 - 50% 
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Tabte4 

PRODNA NNNNNNNNNKNNNN (Length - 14 nucteoiidcs) 

ComptrisoD DNA NNNNNNL U LLLLLU (Length - 16 nucleotides) 

5 

% nucleic acid sequence identity - 

(the number of identically matching nucleotides between the two nucleic acid sequences as determined by AUGN-2) 
divided by (the total number of nucleotides of the PRO-DNA nucleic acki sequence) - 

10 

6 divided by J4 «= 42.9% 
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Table 5 

PRO-DNA NNNNNNNNNNNN (LenglD . „ ffiideo ^ ) 

Comparison DNA NKNNLLLW (Lcoglh « o n**^) 

% nucleic acid sequence idcmiiy « 



(the number of identically maictamc nucleotides henvw*. rh« 

^ " DenvcCT tte acid sequences as detenmned by AUCN-2) 

divided by (the total number of nucleotides of the PRO-DNA nucJeic acid sequence, - 



4 divided by 12 = 33.3% 
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IL Compositions and Methods of the Invention 

I. Preparation of the PRO polypeptides of the invention 

The present invention provides newly identified and isolated nucleotide sequences encoding the 
polypeptides in the present application as PRO polypeptides. In particular. cDNAs encoding various PRO 
5 polypeptides have been identified and isolated, as disclosed in further detail in the Examples below. It is noted 
that proteins produced in separate expression rounds may be given different PRO lumbers but the UNQ 
number is unique for any given DNA and the encoded protein, and will not be changed. However, for the sake 
of simplicity, in the present specification the protein encoded by the full length native nucleic acid molecules 
disclosed herein as well as all further native homologues and variants included in the foregoing definition of 
10 PRO. will be referred to as "PRO/number" or even "PRO*, regardless of their origin or mode of preparation. 

In particular. cDNA encoding a PRO200, PRO204. PR0212. PR0216, PR0226. PRO240. PR0235. 
PR0245. PROI72. PR0273. PR0272. PR0332. PR0526, PRO701. PR0361. PR0362, PR0363. PR0364, 
PR0356. PR053I. PR0533, PRO1083. PR0865. PRO770. PR0769. PR0788. PROIU4. PROI007. 
PR01I84. PROI031. PR01346. PROl 155. PRO1250. PROI312. PROl 192, PR01246. PR01283, PROl 195. 
15 PR01343. PR01418. PR01387. PROI410. PR019I7. PR01868. PRO205. PR02I. PR0269. PR0344. 
PR0333. PR038I. PRO720. PR0R66. PRO840. PR0982. PRQ836. PROH59. PR01358. PR01325. 
PR0133S. PROI434. PR04333. PRO4302. PRO4430 and PR05727 polypeptide < corresponding io UNQI74. 
UNQI78. UNQ186. UNQ190. UNQ200. UNQ214, UNQ209. UNQ2I9. UNQI46. UNQ240. UNQ239. 
UN0293. UNQ330. UNQ365. UNQ3I6. UNQ317, UNQ318. UNQ319. UNQ313. UNQ332. UNQ334. 
20 UNQ540. UNQ434. UNQ408. UNQ407. UNQ430. UNQ557. UNQ49I, UNQ598, UNQ5I6. UNQ70I. 
UNQ585. UNQ633. UNQ678. LTNQ606. UNQ630. LTNQ653. UNQ608, LTNQ698, UNQ732. UNQ722, 
UNQ728. UNQ900. UNQ859, UNQ 1 79. UNQ21, UNQ236, UNQ303, UNQ294. UNQ322. UNQ388, 
UNQ435. UNQ433, UNQ483, UNQ545. UNQ589. UNQ707. UNQ685, UNQ693, UNQ739. UNQI888, 
UNQ1866. UNQ1947 and UNQ2448. respectively) has been identified and isolated, as disclosed in further 
25 detail in ihc Examples below. 

In even greater particularity, the present specification describes the cDNAs DNA29I01-1276. 
DNA3087LU57. DNA3094M 134. DNA330S7-I15S. DN A33460- 1 1 66. DNA34387-U38, DNA35558- 
1167. DNA35638-H41, DNA359I6-1 161. DNA39523-1192, DNA40620-1 183. DNA40982-1235, 
DNA44184-I3I9, DNA44205-I285. DNA454 10-1250. DNA45416-1251, DNA45419-1252. DNA47365- 
30 1206. DNA4747CM130. DNA483 14-1320. DNA49435-1219, DNA5092 M458, DNA53974-140I. 
DNA54228-1366. DNA5423 M366, DNA56405-1357, DNA57033- 1403, DNA57690-1374, DNA59220- 
1514, DNA59294-1381. DNA59776-1600, DNA59849-I504, DNA60775-I532, DNA61 873-1 574. 
DNA62814-I52L DNA64885-1529. DNA65404-1551, DNA654 1 2-1523, DNA66675-1587, DNA68864- 
1629. DNA68872-1620, DNA68874-I622, DNA76400-2528. DNA77624-2515, DNA30868-1 156, 
35 DNA36638-1056, DNA38260-U80, DNA4059M242. DNA41374-1312, DNA44194-13I7, DNA53517- 
1366, DNA53971-1359, DNA53987-I438, DNA57700- 1408, DNA59620-I463, DNA60627-1508, 
DNA64890-16I2, DNA66659-1593, DNA66667-1596, DNA688 18-2536, DNA8421 0-2576, DNA92218- 
2554. DNA96878-2626, DNA98853- 1 739 which encode native sequence PRO200, PRO204, PR0212, 
PR0216, PR0226, PRO240, PR0235, PR0245. PR0172, PR0273, PR0272. PR0332, PR0526, PRO701, 
40 PR0361. PR0362, PR0363, PR0364, PR0356, PR053I. PR0533, PRO1083, PR0865, PRO770, PR0769, 
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PR0788. FR01I14. PRO1007. PR01184. PROI031. PR01346. PRO. 155. PRO1250. PR0131Z PR01192, 
PR01246. PR01283, PROU95. PR01343, PR01418. PR01387, PRO1410. PR01917. PR01868. PRO205, 
PR021. PR0269, PR0344. PR0333. PR0381. PRO720, PROS66. PRO840. PR0982, PR0836 PROU59 
PR01358. PR01325. PROI338. PR01434. PR04333. PRO4302, PRO4430 and PR05727 poiypcptidcs! 
respectively. 

As disclosed in the Examples below, various cDNA clones have been deposited with (he ATCC. the 
actual nucleotide sequence of those clones can readily be determined by the skilled artisan by sequencing of the 
deposited clone using routine methods in the aa It is understood that the sequence of the deposit contains the 
correct sequence in the event of a discrepancy between the deposited sequence and tliose disdosrf Tbc 
predicted amino actd sequence can be determined from the nucleotide sequence using routine skill. For me 
PRO polypeptides and encoding nucleic acids described herein. Applicants have identified what is believed to 
be (he reading frame best identifiable with the sequence information available at the time. 

B. PRO Polypeptide Variants 

In addition to the full-length native sequence PRO polypeptides described herein, i, is contemplated 
that PRO varum* can be prepared. PRO vanants can be prepared by introducing appropriate nucleoride 
changes into the PRO DNA. and-or by symtasis of the desired PRO polypeptide, TTiose skilled in the an will 
appreciate that ammo acid changes may alter post-translational processes of the PRO. such as changing the 
number or position of glycosylate sites or altering the membrane anchoring characteristics. 

Variations in the native full-length PRO sequence or in various domains of the PRO described herein, 
can be made, for example, using any of the techniques and guidelines for conservative and non-conservative 
mutations set form, for instance, in US. Patent No. 5 J64.M4. Variations may be a substitution, deletion or 
insertion of one or more codons encoding the PRO that results in a change in the amino acid sequence of the 
PRO as compared with the native sequence PRO. Optionally die variation is by substitution of at least one 
amino acid with any other ammo acid in one or more of the domains of the PRO. Guidance in determining 
which amino acid residue may be insened. substituted or deleted without advaselv affecting the desired 
acuvity may be found by comparmg the sequence of the PRO with that of homologous known protein 
molecules and minimizmg .he number of amino acid sequence changes made in region, of high homology. 
Ammo acid substitutions can be the result of replacing one amino acid with another amino acid having similar 
structural and/or chemical properties, such as the replacement of a leucine with a serine. Lc. conservative 
ammo acid replacements. Insertions or deletions may optionally be in the range of about I to 5 amino acids. 
The variation allowed may be determined by systematically making insertions, deletions or substitutions of 
ammo acids in the sequence and testing the resulting variants for activity exhibited by the full-length or mature 
native sequence. 

PRO polypeptide fragments are provided herein. Such fragments may be truncated at the N-termmus 
or Ctennmus. or may lack internal residues, for example, when compared with a full length native protein. 
Catam fragments lack ammo acid residues that are not essential for a desired biological activity of the PRO 



PRO fragments may be prepared by any of a number of conventional techniques. Desired peptide 
fragments may be chemically synthesized An alternative approach involves generating PRO fragments by 
40 enzymatic digestion. e*. by treatmg the proiem with an enzyme known to cleave protems at sites defined by 
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panicular amino acid residues, or by digesting the DNA wiih suitable restriction enzymes and isolating die 
desired fragment. Yet another suitable technique involves isolating and amplifying a DNA fragment encoding 
a desired polypeptide fragment, by polymerase chain reaction (PCR). Oligonucleotides that define the desired 
termini of the DNA fragment are employed at the 5' and 3* primers tn the PCR. Preferably, PRO polypeptide 
fragments share at least one biological and/or immunological activity with the native PRO polypeptide 
disclosed herein. 

In particular embodiments, conservative substitutions of interest are shown in Table 6 under the 
heading of preferred substitutions. If such substituuons result in a change in biological activity, then more 
substantial changes, denominated exemplary substitutions in Table 6. or as farther described below in reference 
to ammo acid classes, are introduced and the products screened. 

Table 6 



Original 
Residue 



Exemplary 
Substitutions 



Preferred 
Substitutions 



15 



20 



25 



30 



35 



Ala (A) 
Arg(R) 
Asn(N) 
Asp(D) 
Cys(C) 

Gln(O) 
GIu (E) 
Gly(G) 
His(H) 

nc(o 

Leu a) 

Lys(K) 

Met(M) 

Phe(F) 

Pro(P) 

Ser(S) 

ThrfT) 

Trp(W) 

TyrfY) 
Vnl(V) 



val: leu; ile 

lys; gin: asn 

gin: his; lys: arg 

glu 

ser 

asn 

asp 

pro: ala 

asn: gin: lys: arg 

leu: val; met; ala: phe: 

norleucine 

norleucinc: ile: val: 

met; ala: phc 

arg; gin; asn 

leu; phe; ile 

leu; val; ile: ala; ryr 

ala 

thr 

ser 

ryr, phe 

tip; phe; thr; ser 
ile; leu; met; phe; 
ala; norleucine 



val 
lys 
gin 
glu 
ser 
asn 
asp 
ala 



leu 

tie 

arg 
leu 
leu 
ala 
thr 
ser 
ryr 
phe 

leu 



40 



Substantial modifications in function or immunological identity of the 
accomplished by selecting substitutions that differ significantly in their effect on 



polypeptide are 
(a) the s 
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of the polypeptide backbone in the area of the substitution, for example, as a sheet or helical confonnanon. (b) 
the charge or hydrophobicity of the molecule at the target site, or (c) the bulk of the side cham. Naturally 
occurring residues are divided into groups based on common side-chain properties: 

(1 ) hydrophobic: norleucine. met. ala. val. leu. ile; 

(2) neutral hydrophilic: cys. ser. thr; 

(3) acidic* asp, glu; 

(4) basic: asn, gin, his. lys, arg; 

(5) residues that influence chain orientation: gly, pro: and 

(6) aromatic: up. tyr, phe. 

Non-conservative substitutions will entail exchanging a member of one of these classes for another 
class. Such substituted residues also may be mtroduced into the conservative substitution sites or. more 
preferably, into the remaining (non-conserved) sites. 

The variations can be made using methods known in the art such as oligonucleotide-mediated (site- 
directedl mutagenesis, alanine scannmg, and PCR mutagenesis. Site-directed mutagenesis [Carter a a!.. .Vucl. 
MidsRcs.. 13:4331 (1986): Zollcrwu/.. XucL Acids Res.. ]0:6487 (1987)). cassette mutagenesis [Weils era/.. 
Gene. 34:315 (1985)). rcsmction selection mutagenesis [Wells et at.. Philos. Trans. R. Soc. London SerA. 
317:415 (I986)J or other known techniques can be performed on the cloned DNA to produce the PRO variant 
variant DNA. 

Scanning amino acid analysis can also be employed to identify one or more amino acids along a 
contiguous sequence. Among the preferred scanning amino acids are relatively small, neutral amino acids. 
Such amino acids include alanine, glycine, serme, and cysteine. Alanine is typically a preferred .-canning 
amino acid among this group because it eliminates the side-chain beyond the beta-carbon and is less likely to 
alter the mam-cham conformation of the variant (Cunningham and Wells. Science. 244: 1081-1085 (1989)]. 
Alanine is also typically preferred because it is the most common amino acid Further, it is frequently found m 
both buried and exposed positions (Creighton. The Proteins. (W.H. Freeman & Co„ N.Y.): Cbothia. J. Mot. 
Biol.. 150:1 (1976)]. If alanine substituuon does not yield adequate amounts of vanant. an isoteric amino acid 
can be used 

C. Modifications of PRO 

Covalent modifications of PRO polypeptides are included within the scope of this invention. One 
type of covalent modification includes reacting targeted amino acid residues of a PRO polypeptide with an 
orgamc derrvmixmg agm mat is capabte 

of the PRO. Derivadanon with bimnctional agents is useful for instance, for crosslink^ PRO to a water- 
insoluble support matrix or surface for use in the method for purifying ami-PRO antibodies, and vice-versa. 
Commonly used blinking agents include, e.g.. I.l-bi^diazo^l^-phaiylemane. glutaraldeuyde, N- 
hydroxysucriiumide esters, for example, esters with 4-azidostdicyltc acid, homobifunctional irmdoesters. 
including disucemimidyl esters such as 3J'-ditmobis(succmimidylpropionate). bt&ncrional maleimides such as 
bis-N-mdeimido-U-octane and agents such as memyl-3-[(p-azidopbcnyl)diudolpropioimidate. 

Other modifications include deamidation of ghaaminyl and asparaginyl residues to the corresponding 
glutamyl and a-rpanyi residues, respectively, hydroxyUtion of proline and lysine, phosphorylation of hydroxy! 
groups of seryl or threonyi residues, methylation of me a-amino groups of lysine, argmme. and bistidine side 
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chains [I.E. Creighton. Proteins: Structure and Molecular Properties. W.H. Freeman & Co.. San Francisco, 
pp. 79-86(1983)1, acerylaiion of the N-ierminai amine, and amidation of any C-cerminaJ carboxyl group. 

Another rype of covalem modification of the PRO polypeptide included within the scope of this 
invention comprises altering the native glycosylation pattern of the polypeptide. "Altering the native 
glycosylation pattern" is intended for purposes herein to mean deleting one or more carbohydrate moieties 
found in native sequence PRO polypeptide (either by removing the underlying glycosylation site or by deleting 
the glycosylation by chemical and/or enzymatic means), and/or adding one or more glycosylation sites that are 
not present in the native sequence PRO. In addition, the phrase includes qualitative changes in the 
glycosylation of the native proteins, involving a change in the nature and proportions of the various 
carbohydrate moieties present. 

Addition of glycosylation sites to the PRO polypeptide may be accomplished by altering the amino 
acid sequence. The alteration may be made, for example, by the addition of. or substitution by, one or more 
serine or threonine residues to the native sequence PRO (for O-linked glycosylation sites). The PRO amino \ 
acid sequence may optionally be altered through changes at the DNA level, particularly by mutating the DNA 
encoding the PRO polypeptide at preselected bases such that codons arc generated that will translate into the 
desired ammo acids. 

Another means of increasing the number of carbohydrate moieties on the PRO polypeptide is by 
chemical or enzymatic coupling of glycosides to the polypeptide. Such methods are described in the art. e.g.. 
in WO 87,105330 published 1 1 September 1987. and in Aplin and Wriston, CRC Cra. Rev. Biochem.. pp. 259- 
306(1981). 

Removal of carbohydrate moieties present on the PRO polypeptide may be accomplished chemically 
or cnzymatically or by mutational substitution of codons encoding for amino acid residues that serve as targets 
for glycosylation. Chemical deglycosylation techniques are known in the an and described- for instance, by 
Hakimuddin. et aL Arch. Biochem. Biophys.. 259:52 fI987) and by Edge ct a!.. Anal. Biochem.. 118:131 
fl 98 1). Enzymatic cleavage of carbohydrate moieties on polypeptides can be achieved by the use of a variety 
of endo- and exo-glycosidases as described by Thotakura et aL Meth. EnzymoL. 238:350 f 1987). 

Another rype of covalem modification of PRO comprises linking the PRO polypeptide to one of a 
variety of nonproteinaceous polymers, *g m , polyethylene glycol (PEG), polypropylene glycol, or 
polyoxyalkylencs. in the manner set forth in U.S. Patent Nos. 4,640.835; 4,496,689; 4.301.144; 4.670,417: 
4/791,192 or 4,179,337. 

The PRO polypeptides may also be modified in a way to form a chimeric molecule comprising the 
invention polypeptide fused to another, heterologous polypeptide or amino acid sequence. 

In one embodiment, such a chimeric molecule comprises a fusion of the PRO with a tag polypeptide 
which provides an epitope to which an anti-tag antibody can selectively bind. The epitope tag is generally 
placed at the amino- or carboxyl- terminus of the PRO. The presence of such epitope-tagged forms of the PRO 
polypeptide can be detected using an antibody against the tag polypeptide. Also, provision of the epitope tag 
enables the PRO to be readily purified by affinity purification using an anti-tag antibody or another rype of 
affinity matrix that binds to the epitope tag. Various tag polypeptides and their respective antibodies are well 
known in the an. Examples include poly-histidine (poly-his) or poly-histidine-giycine (poly-bis-gly) tags; the 
flu HA tag polypeptide and its antibody 12CA5 [Field et aL, Moi Cell Biol., 8:2159-2165 (1988)); the c-myc 
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tag and the 8F9. 3C7. 6E10. 04. B7 and 9E10 antibodies thereto (Evan et aL Molecular and Cellular Biology, 
5:3610-3616 (1985)1; and the Herpes Simplex virus glycoprotem O (gD) tag and its antibody [Paborsky et at. 
Protein Engineering. 3(6):547-553 (1990)J. Other tag polypeptides include the Flag-peptide (Hopp er aL 
BioTechnology. 6:1204-1210 (I988)J; the KT3 epitope pepude (Martin et aL Science, 255: 192- 194 (1992)]; an 
o-tubulin epitope peptide [Skinner et aL J. Biol. Chem.. 266:15163-15166 (1991)]; and the 17 gene 10 
protein peptide tag [Lutz-Freyermuth er al.. Proc Natl. Acad. Sci USA. 87:6393-6397 (1990)]. 

In an alternative embodiment, the chimeric molecule may comprise a fusion of the PRO polypeptide 
with an immunoglobulin or a particular region of an immunoglobulin. For a bivalent fonn of the chimeric 
molecule (also referred «o as an ••immuttwdhesm"). such a fiision could be to the Fc region of an IgG molecule. 
The Ig fusions preferably include the substitution of a soluble (transmembrane domain deleted or inactivated) 
form of an invention polypeptide in place of at least one variable region within an Ig molecule. In a 
particularly preferred embodiment, the immunoglobulin fusion includes the hinge. CH2 and CH3. or the hinge. 
CHI. CH2 and CH3 regions of an IgG 1 molecule. For the production of immunoglobulin fusions see also US 
Patent No. 5.428.130 issued June 27. 1995. 
•5 D. Preparation of PRO 

The dcscnpt.on below relates to primarily to production of PRO by otituring cells transformed or 
transfected with a vector containing PRO nucleic acid. It is. of course, contemplated that alternative methods, 
which am well known in .be art. may be employed to prepare PRO. For instance, the PRO sequence, or 
portions thereof, may be produced by direct peptide symhests using solid-phase techniques (see. e.g.. Stewart 
e, al. Solid-Phase Peptide Synthesis. W.H. Freeman Co. San Francisco. CA (1969): Merrificld. J. Am. Chcm. 
Soc 85: 2149-2154 (1963)]. In Mtro proton synthesis may be performed using manual techniques or by 
automation. Automated synmesis may be accomplished, for instance, using an Applied Biosystems Peptide 
Synthesizer (Foster City. CA) using the manufacturer s instructions. Various portions of the PRO may be 
chemically synthesized separately and combined using chemical or enzymatic methods to produce the full- 
25 length PRO. 

'• Isolation of D NA F.ncndinP the PRO Polypeptide^! 
DNA encoding the PRO may be obtamed from a cDNA library prepared from tissue believed to 
possess the polypeptide mRNA and to express it a. a detectable level. Accoidtngly. human PRO DNA can be 
conveniently obtained from a cDNA library prepared from human tissue, such as described in the Examples. 
The PRO-encoding gene may also be obtained from a genomic library, oligonucleotide synthesis, or other 
known synthetic procedures (e.g.. automated nucleic acid synthesis). 

Libraries can be screened with probes (such as antibodies to the PRO polypeptide or oligonucleotides 
of at leas, about 20-80 bases) designed to identify the gene of interest or the protein encoded by it Screening 
the cDNA or genomic library with the selected probe may be conducted using standard procedures, such as 
described in Sambrook « al.. Molecular Cloning: A Laboratory Manual (New York: Cold Spring Harbor 
Laboratory Press, 1989). An alternative means to isolate the gene encoding the PRO polypeptide is to use PCR 
methodology [Sambrook et aL supra-. Dieffenbach et aL PCR Primer: A Laboratory Manual (Cold Spring 
Harbor Laboratory Press, 1995)J. 

The Examples below describe techniques for screening a cDNA library. The oligonucleotide 
sequences selected as probes should be of sufficient length and sufficiently unambiguous that false positives 
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ore minimized. The oligonucleotide is preferably labeled such that ii can be detected upon hybridization to 
DNA in the library being screened. Methods of labeling are well known in the an, and include the use of 
radiolabels like "P-labeled ATP, biotmylation or enzyme labeling. Hybridization conditions, including 
moderate stringency and high stringency, are provided in Sambrook et aL supra. 
5 Sequences identified in such library screening methods "can be compared and aligned to other known 

sequences deposited and available in public databases such as GenBank or other private sequence databases. 
Sequence identity (at either the amino acid or nucleotide level) within defined regions of the molecule or across 
the full-length sequence can be determined using methods known in the art and as described herein. 

Nucleic acid having protein coding sequence may be obtained by screening selected cDNA or 
10 genomic libraries using the deduced amino acid sequence disclosed herein for the first time. and. if necessary, 
using conventional primer extension procedures as described in Sambrook et al.. supra, to detect precursors 
and processing intermediates of mRNA that may not have been reverse-transcribed into cDNA. 
2. Selection and Transformation of Host Cells 
Host cells are transfected or transformed with expression or cloning vectors described herein for 

1 5 production of the PRO polypeptides and cultured in conventional nutrient media modified as appropriate for 
inducing promoters, selecting trans form an is. or amplifying the genes encoding the desired sequences. The 
culture conditions, such as media, temperature. pH and the like, can be selected by the skilled artisan without 
undue expenmemauon. In general, principles, protocols, and practical techniques for maximizing the 
productivity of cell cultures can be found in Mammalian Cell Biotechnology: A Practical Approach. M. 

20 Butler, cd. (IRL Press. 1 991 ) and Sambrook et aL supra. 

Methods of transfection are known to the ordinarily skilled artisan, for example. CaPC>4 and 
electro poration. Depending on the hosi cell used, transformation is performed using standard techniques 
appropriate to such cells. The calcium treatment employing calcium chloride, as described in Sambrook et aL, 
supra, or eiectroporation is generally used for prokaryotes or other cells that contain substantial cell-wall 

25 barriers. Infection with Agrobactcrium tumetaciens is used for trans formation of certain plant cells, as 
described by Shaw a aL Gene. 23:315 (1983) and WO 89/05859 published 29 June 1989. For mammalian 
ceils without such cell walls, the calcium phosphate precipitation method of Graham and van der Eb. Urology, 
52:456-457 (1978) can be employed- General aspects of mammalian cell host system transformauons have 
been described in U.S. Patent No. 4.399.216. Transformations into yeast are typically carried out according to 

30 the method of Van Solingen et aL J. Baa.. 130:946 (1977) and Hsiao et aL Proc. Natl. Acad. Sci. (USA), 
76:3829 (1979). However, other methods for introducing DNA into ceils, such as by nuclear microinjection, 
electro porauon. bacterial protoplast fusion with intact cells, or polycanons. e.g.. polybrene, polyornimine. may 
also be used. For various techniques for trims forming mammalian cells, see fCeown et aL Methods in 
Eraymology, 185:527-537 (1990) and Mansour et aL Nature. 336:348-352 (1988). 

35 Suitable host ceils for cloning or expressing the DNA in the vectors herein include prokaryote. yeast 

or higher eukaryote cells. Suitable prokaryotes include but are not limited to eu bacteria, such as Gram- 
negative or Gram-positive organisms, for example, Enterobacteriaceae such as E. calL Various E. coli strains 
are publicly available, such as £. coli K12 strain MM294 (ATCC 3 1 .446); E. coli X1776 (ATCC 3 1,537); £. 
coli strain W31 10 (ATCC 27,325) and K5 772 (ATCC 53,635). Other suitable prokaryotic host cells include 

40 Enterobacteriaceae such as Escherichia* eg.. E. coli, Emerobacter. Envinia. Klebsiella. Proteus, Salmonella, 
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e.g.. Salmonella ryphmurium. Serrano. eg.. Serratia marcescans. and Shigella, as well as Bacilli such as A 
subtilis and B. lieheniformis (e.g.. B. lieheniformis 41P disclosed in DD 266.710 published 12 April 1989). 
Pseudomonas such as P. aeruginosa, and Streptomyces. These examples ate illustrative rather than limiting. 
Strain W3110 is one particularly preferred host or parent host because it is a common host strain for 
recombinaut DNA product fermentations. Preferably, the host cell secretes minimal amounts of proteolytic 
enzymes. For example, strain W3 1 10 may be modified to effect a genetic mutation in the genes encoding 
proteins endogenous to the host, with examples of such hosts including £ eoti W31 10 strain 1A2. which has 
the complete genotype ,onA : £ col, W31 10 stratn 9E4. which has the complete genotype tonA ptrS: £ coli 
W31 10 stratn 27C7 (ATCC 55.244). which has the complete genotype tonA ptrS phoA EIS (argF.lac)l69 
dcgP ompTkan: E. coli W31 10 strain 37D6. which has the complete genotype tonA ptrj phoA EIS (argf- 
lacim degP ompT rbs7 ilvG kan : E coli W3 1 10 straw 40B4. which is strain 37D6 with a non-kanamycin 
resistant degP deletion mutation: and an £. coli strain having mutant periplastic protease disclosed in U.S. 
Patent No. 4.946.783 issued 7 August 1990. Alternatively, in , mx> methods of clotting, e.g.. PCR or other 
nucleic acid polymerase reactions, are suitable. 

In addition to prokaryotes. cukaryotic microbes such as filamentous fungi or yeast are suitable cloning 
or expression hosts for PRO-encoding vectors. Saccharomyces ccrevisiae is a commonly used lower 
eukaryottc host microorganism. Others .nelude Schizosaccharomyccs pon.be ( Beach and Nurse. Namre. 290- 
140 [I98IJ: EP 139.383 published 2 May 1985): Kluyi-eromyccs hosts (U.S. Patent No. 4.943.529: Fleer e, aL 
Biotechnology; 9:968-975 (1991 )) such as. eg.. A', lacm fMW98-8C. CBS683. CBS4574: Louvencourt et al 
J. BaaerioL I54<2):737.742 (.983)). Al fragilis (ATCC 12.424). K. bulgaricus (ATCC 16.045). K. 
~ickeramii (ATCC 24.178). K. ua/m (ATCC 56.500). K. drosophilarum (ATCC 36.906: Van den Berg « al.. 
Biotechnology, 8:135 (1990)). Al thermogram, and K. marxianus: yarrow* (EP 40Z226): Pichia postoris 
(EP 183.070: Sreekrishru et al.. J. Basic Microbiol.. 28:265-278 [1988]); Candida: Trichoderma reesia (EP 
244.234): Neurosporacrassa (Case etal.. Proc. Na„. Acad. Sci. USA. 76:5259-5263 [19791): ScbwannWcer 
such as Schwamu^res occidental* (EP 394.538 published 3 1 October 1990): and filamentous mnei such as. 
e.g.. Nettrospora. Pcnicillium. Tohpocladium ,WO 91/00357 published 10 January 1991). and Aspergilim 
hosts such as A. nidulans (Ballanee ct al.. Biochem. Biophys. Res. Common.. 112:284-289 [1983]: Tilbum et 
al. Gene. 26:205-221 [1983J: Velum a a,.. Proc. Nat,. Acad. Sci USA. U: 1470-1474 [1984]) am! A. niger 
(Kelly and Hynes. EMBOJ.. 4:475-479 [.985]). Memylotrop.c yeasts are suitable herein and include, but are 
not limited to. yeast capable of growth on memanol selecmd from the genera consisting of Hansenula. 
Candida. Kloeckera. Pichia. SaccAaramyves. Torulopsis. and Rhodotontla. A list of specific species that are 
exemplary of this class of ye** may be found in C. Anthony, 77,e Biochemistry ofMeihylotrophs, 269 ( 1 982). 

Suitable host cell, for the expression of glycosylated PRO polypeptides are derived from multicellular 
organisms. Examples of invertebrate cells include insect cells such as Drosophtla S2 and Spodoptera Sf9, as 
well as plant ceUs. Examples of useful mammalian host cell lines include Chinese hamster ovary (CHO) and 
COS cells. More specific examples include monkey kidney CV1 line transformed by SV40 (COS-7, ATCC 
CRL 1651); human embryonic kidney line (293 or 293 cells subcloned for growth in suspension culture, 
Graham et al. J. Gen ViroL 36:59 (1977)); Chinese hamster ovary cellsZ-DHFR (CHO, Urlaub and Chasm. 
Proc Nad. Acad. Set USA 77:4216 (1980)); mouse Sertoli cells (TM4, Mather, Biol. Reprod. 23:243-251 
(1980)); human lung cells (W138. ATCC CCL 75); human liver cells (Hep G2, HB 8065); and mouse 
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mammary tumor (MMT 060562. ATCC CCLSl). The selection of the appropriate host cell is deemed to be 
within the skill in the art 

3. Selection and Use of a Replicable Vector 
Trie nucleic acid (e.g.. cDNA or genomic DNA) encoding the PRO polypeptides may be inserted into 
5 a replicable vector for cloning (amplification of the DNA) or for expression. Various vectors are publicly 
available. The vector may. for example, be in the form of a plasmid. cosmid. viral particle, phagemid or phage. 
The appropriate nucleic acid sequence may be inserted into the vector by a variety of procedures. In general, 
DNA is inserted into an appropriate restriction endonuciease site(s) using techniques known in the art. Vector 
components generally include, but are not limited to. one or more of a signal sequence, an origin of replication, 
10 one or more marker genes, an enhancer element, a promoter, and a transcription termination sequence. 
Construction of suitable vectors containing one or more of these components employs standard ligation 
techniques which are known to the skilled artisan. 

The PRO may be produced recombinant^ not only directly, but also as a fusion polypeptide with a 
heterologous polypeptide, which may be a signal sequence or other polypeptide having a specific cleavage site 
15 at the N-termmus of the mature protein or polypeptide. In general, the signal sequence may be a component of 
the vector, or it may be a pan of the PRO-encoding DNA that is inserted into the vector. The signal sequence 
may be a prokaryotic signal sequence selected, for example, from the group of the alkaline phosphatase, 
penicillinase. 1pp. or heat-stabie entcrotoxin II leaders. For yeast secretion the signal sequence may be, e.g.. 
the yeast invertase leader, alpha factor leader (including Succharomyvcs and Kluyveromyces a-foctor leaders, 
the latter described in U.S. Patent No. 5.010J82). or acid phosphatase leader, the C albicans glucoamylase 
leader (EP 362.179 published 4 April 1990), or the signal described in WO 90/13646 published 15 November 
1990. in mammalian cell expression, mammalian signal sequences may be used to direct secretion of the 
protein, such as signal sequences from secreted polypeptides of the same or related species, as well as viral 
secretory leaders. 

25 Both expression and cloning vectors contain a nucleic acid sequence that enables the vecior to 

replicate in one or more selected host cells. Such sequences are well known ibr a variety of bacteria, yeast, and 
viruses. The origin of replication from the plasmid pBR322 is suitable for most Gram-negative bacteria, the 2u 
plasmid origin is suitable for yeast, and various viral origins (SV40 t polyoma, adenovirus. VSV or BPV) are 
useful for cloning vectors in mammalian cells. 

Expression and cloning vectors will typically contain a selection gene, also termed a selectable 
marker. Typical selection genes encode proteins that (a) confer resistance to antibiotics or other toxins. e.g. 
ampicillin. neomycin, methotrexate, or tetracycline, fb) complement auxotrophic deficiencies, or (c) supply 
critical nutrients not available from complex media, e.g., the gene encoding D-alanine racemase for Bacilli. 

An example of suitable selectable markers for mammalian cells are those chat enable the identification 
35 of cells competent to take up the PRO-encoding nucleic acid, such as DHFR or thymidine kinase. An 
appropriate host ceil when wild-type DHFR is employed is the CHO cell line deficient in DHFR activity, 
prepared and propagated as described by Urlaub et al., Proc. Nad. Acad ScL USA, 77:4216 (1980). A suitable 
selection gene for use in yeast is the irpl gene present in the yeast plasmid YRp7 [Stinchcomb et al. Nature, 
28239 (1979); Kingsman et aL, Gene. 7:141 (1979); Tschemper et al.. Gene, 10:157 (1980)]. The irpl gene 
provides a selection marker for a mutant strain of yeast lacking the ability to grow in tryptophan, for example, 
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ATCC No. 44076 or PEP4- I (Jones. Genetics. 85: 12 (1977)]. 

Expression and cloning vectors usually contain a promoter operebly linked to the PRO-encoding 
nucleic acid sequence to direct mRNA synthesis. Promoters recognized by a variety of potential host cells are 
well known. Promoters suitable for use with prokaryotic hosts include the (i- lactamase and lactose promoter 
systems (Chang et aL Nature. 275:615 (1978); Goeddel et aL Nature. 281:544 (1979)], alkaline rAospaatase. 
a tryptophan (up) promoter system (Goeddel A'udetc Acids Res,. 8:4057 (1980); EP 36.776], and hybrid 
promoters such as the tac promoter (deBoer et at.. Proc. Nad. Acad. Sci. USA. 80:21-25 (1983)). Promoters for 
use in bacterial systems also will contain a Shine-Dalgarno (S.D.) sequence operably linked to the DNA 
encoding PRO. 

Examples of suitable promoting sequences for use with yeast hosts include the promoters for 3- 
phosphoglycerate kinase [Hitzeman et ad.. J. Biol. Chem.. 255:2073 (1980)] or other glycolytic enzymes [Hess 
et al., J. Adv. Enzyme Reg., 7:149 (1968); Holland. Biochemistry. \TA900 (1978)], such as enolasc. 
glyceraldehyde-3-phosphate dehydrogenase, hexokinase. pyruvate decarboxylase, phospho-fructokmase, 
glucose-6-phosphate isomerase. 3-phosphoglycerate mutase. pyruvate kinase, triosephosphate isomerase. 
phosphoglucose isomerase. and glucokinase. 

Other yeast promoters, which arc inducible promoters having the additional advantage of transcription 
controlled by growth condiuons. are the promoter regions for alcohol dehydrogenase 2. isocytochrome C. acid 
phosphatase, degradative enzymes associated with nitrogen metabolism, metallothionein, glyceraldehydc-3- 
phosphate dehydrogenase, and enzymes responsible for maltose and galactose utilization. Suitable vectors and 
promoters for use in yeast expression are further described in EP 73,657. 

PRO transcription from vectors in mammalian host cells is controlled, for example, by promoters 
obtained from the genomes of viruses such as polyoma vims, fowlpox virus (UK 121 1.504 published 5 Jury 
1989). adenovirus (such as Adenovirus 2), bovine papilloma virus, avian sarcoma virus, cytomegalovirus, a 
retrovirus, hepatitis-B virus and Simian Virus 40 (SV40). from heterologous mammalian promoters, e.g.. the 
actin promoter or an immunoglobulin promoter, and from heat-shock promoters, provided such promoters are 
compatible wich the host cell systems. 

Transcription of a DNA encoding the PRO polypepude by higher eukaryotes may be increased by 
iriserting an enhancer sequence into the vector. Enhancers are cis-actmg elements of DNA. usually about from 
10 to 300 bp. that act on a promoter to increase its Description. Many enhancer sequences are now known 
from mammalian genes (globin. elasiase. albumin, a- fetoprotein, and insulin). Typically, however, one will 
use an enhancer from a eukaryoric cell virus. Examples include the SV40 enhancer cm the late side of the 
replication origin (bp 100-270), the cytomegalovirus early promoter enhancer, the polyoma e nhance r on the 
late side of the replication origin, and adenovirus enhancers. The enhancer may be spliced into the vector at a 
position 5' or 3' to the coding sequence of the PRO polypeptide, but is preferably located at a site 5* from the 
promoter. 

Expression vectors used in eukaryoric host cells (yeast, fungi, insect, plant animal, human, or 
nucleated ceils from other multicellular organisms) will also contain sequences necessary for the termination of 
transcription and for stabilizing the mRNA. Such sequences are commonly available from the 9 and, 
occasionaily 3', untranslated regions of eukaryoric or viral DNAs or cDNAs. These regions contain nucleotide 
segments transcribed as polyadenylared fragments in the untranslated portion of the mRNA encoding PRO. 
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Still other methods, vectors, and host cells suitable for adaptation to the synthesis of the PRO 
polypeptide in recombinant vertebrate cell culture are described in Gething ct aL Nature, 293:620-625 (1981); 
Mantei et aL, Nature. 28I:4<M6 (1979); EP 1 17.060: and EP 1 17,058. 
4. Detecting Gene Expression 
5 Gene expression may be measured in a sample directly, for example, by conventional Southern 

blotting. Northern blotting to quanntate the transcription of mRNA (Thomas. Proc Nad. Acad ScL USA, 
77:5201-5205 (1980)], dot blotting (DNA analysis), or in situ hybridization, using an appropria tely labeled 
probe, based on the sequences provided herein. Alternatively, antibodies may be employed that can recognize 
specific duplexes, including DNA duplexes. RNA duplexes, and DNA-RNA hybrid duplexes or DNA-protein 
1 0 duplexes. The antibodies in turn may be labeled and the assay may be earned out where the duplex is bound to 
a surface, so that upon the formation of duplex on the surface, the presence of antibody bound to the duplex 
can be detected. 

Gene expression, alternatively, may be measured by immunological methods, such as 
immunohistochemical staining of cells or tissue sections and assay of cell culture or body fluids, to quanntate 

15 directly the expression of gene product. Antibodies useful for immunohistochemical staining and'or assay of 
sample fluids may be either monoclonal or polyclonal, and may be prepared in any mammal Conveniently, 
die antibodies may be prepared against a native sequence PRO polypeptide or against a synthetic peptide based 
on the DNA sequences provided herein or against exogenous sequence fused to DNA encoding the PRO 
polypeptide and encoding a specific antibody epitope. 

20 5. Purification of Polypeptide 

Forms of the PRO may be recovered from culture medium or from host cell lysates. If membrane- 
bound, tt can be released from the membrane using a suitable detergent solution (e.g.. Triton'-X 100) or by 
enzymatic cleavage. Cells employed in expression of the PRO polypeptide can be disrupted by various 
physical or chemical means, such as freeze -thaw cycling, sonication. mechanical disruption, or cell lysing 

25 agents. 

it may be desired to punty PRO polypeptide from recombinant cell proteins or polypepudes. The 
following procedures arc exemplary of suitable purification procedures: by fractionation on an ion-exchange 
column: ethanol precipitation: reverse phase HPLC: chromatography on silica or on a cation-exchange resin 
such as DEAE; cfaromatofocusing; SDS-PAGE; ammonium sulfate precipitation: gel filtration using, for 

30 example. Sephadex G-75; protein A Sepharose columns to remove contaminants such as IgG; and metal 
chelating columns to bind epitope-tagged forms of the PRO polypepti de. Various methods of protein 
purification may be employed and such methods are known in the art and described for example in Deutscher. 
Methods in £nzymoiogy % IZ2 (1990): Scopes, Protein Purification: Principles and Practice. Springer- Verlag, 
New York (1982). The purification step(s) selected will depend, for example, on the nature of the production 

3 5 process used and the particular PRO polypeptide produced. 
E. Tissue Distribution 

The location of tissues expressing the PRO can be identified by detennining mRNA expression in 
various human tissues. The location of such genes provides information about which tissues are most likely to 
be affected by the stimu la t i n g and inhibiting activities of the PRO polypeptides. The location of a gene in a 
40 specific tissue also provides sample tissue for the activity blocking assays discussed below. 
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As noted before, gene expression in various tissues may be measured by conventional Southern 
blotting. Northern blotting to quanritate the transcription of mRNA (Thomas. Proc. Nad. Acad. ScL USA. 
77:5201-5205 [1980]). dot blotting (DNA analysis), or in situ hybridization, using an appropriately labeled 
probe, based on the sequences provided herein. Alternatively, antibodies may be employed mat can recognize 
5 specific duplexes, including DNA duplexes, RNA duplexes, and DNA-RNA hybrid duplexes or DNA-protexn 
duplexes. 

Gene expression in various tissues, alternatively, may be measured by immunological methods, such 
as immunohisiochemical staining of tissue sections and assay of cell culture or body fluids, to quantimte 
directly the expression of gene product. Antibodies useful for trnmunohisiochemical staining and/or assay of 
) sample fluids may be either monoclonal or polyclonal and may be prepared in any mammal Conveniently, 
the antibodies may be prepared against a native sequence of a PRO polypeptide or against a synthetic peptide 
based on me DNA sequences encoding the PRO polypeptide or against an exogenous sequence fused to a DNA 
encoding a PRO polypeptide and encoding a specific antibody epitope. General techniques for generating 
antibodies, and special protocols for Northern blotting and in situ hybridization are provided below. 
F. Antibody Binding Studies 

The activiry of the PRO polypeptides can be further verified by antibody binding studies, in which the 
ability or anti-PRO200. anti-PRO204. ann-PR02l2. anti-PR0216. anti-PR0226, ami-PRO240. anti-PR0235. 
anri-PR0245. anti-PR0172, anti-PR0273. anti-PR0272, anti-PR0332, anti-PR0526. anti-PRO70l. anti- 
PR036I. ami-PR0362. anti-PR0363. anti-PR0364. anti-PR0356. onti-PR053I. anti-PR0533. anti-PRO1083, 
anti-PR0865. anti-PRO770, anu-PR0769. anti-PR0788. anti-PROU14. anri-PRO1007. anti-PROl 184. anti- 
PRO1031. ami-PR01346. anti-PROl 155, anti-PRO1250. anti-PROI3l2. anti-PROI 192. anti-PROt246. am> 
PR01283. anti-PROI 195. anti-PR01343. anti-PR01418. anti-PRO!387. anti-PROI4IO. anti-PRO!917. anti- 
PROl 868. ami-PRO205, anti-PR02h anti-PR0269, anti-PR0344. anti-PR0333. anti-PR038l. anti-PRO720. 
anri-PR0866. anu-PRO840. anti-PR0982. anti-PR0836. anti-PROl 159. anti-PR01358. anti-PR01325. anti- 
PR01338. anu-PROI434. anti-PR04333. ami-PRO4302. anu-PRO4430 or anti-PR05727 antibodies to inhibit 
the effect of the PRO200. PRO204. PR02I2. PR0216. PR0226. PRO240. PR0235. PR0245. PR0172. 
PR0273. PR0272. PR0332. PR0526. PRO70I. PR0361. PR0362. PR0363. PR0364. PR0356. PR053I. 
PR0533. PRO1083. PR0865, PRO770, PR0769. PR0788. PROI1I4. PRO1007, PROU84, PRO1031, 
PR01346. PR01155, PRO1250. PR01312. PROH92, PR01246. PROI283. PROl 195. PR01343, PR01418. 
PROI387. PROI4I0. PR01917, PR01868. PRO205. PR021. PR0269. PR0344. PR0333, PR0381. PRO720, 
PR0866, PRO840, PR0982, PR0836. PROI159. PROU58, PRO 1325, PR01338, PR01434, PR04333. 
PRO4302, PRO4430 or PR05727 polypeptides, respectively, on tissue ceils is tested. Exemplary antibodies 
include porycional monoclonal, humanized, bispecific. and heteroconjugaie antibodies, the preparation of 

which will be described hereinbelow. 

Antibody binding studies may be carried out in any known assay method, such as competitive binding 

assays, direct and indirect sandwich assays, and inimuooprecipiiation assays. Zola, Monoclonal Antibodies: A 

Manual of Techniques, pp. 147-158 (CRC Press, Inc., 1987). 

Competitive binding assays rely on the ability of a labeled standard to compete with the test sample 

anatyte for binding with a limited amount of antibody. The amount of target protein in the test sample is 

inversely proportional to the amount of standard that becomes bound to the antibodies. To feciHtare 
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deiermining the amount of standard that becomes bound, the antibodies preferably are insolubilized before or 
after the competition, so that the standard and analyte that are bound to the antibodies may conveniently be 
separated from the standard and analyte which remain unbound. 

Sandwich assays involve the use of two antibodies, each capable of binding to a different 
immunogenic portion, or epitope, of the protein to be detected. In a sandwich assay, the test sample analyte is 
bound by a fust antibody which is immobilized on a solid support, and thereafter a second antibody binds to 
the analyte. thus forming an insoluble three-part complex. See, e.g., US Pat No. 4,376,1 10. The second 
antibody may itself be labeled with a detectable moiety (direct sandwich 

assays) or may be measured using an antiimmunoglobulin antibody that is labeled with a detectable moiety 
(indirect sandwich assay). For example, one type of sandwich assay is an ELISA assay, in which case the 
detectable moiety is an enzyme. 

For mimunohiscochemistry. the tissue sample may be fresh or frozen or may be embedded in paraffin 
and fixed with a preservative such as formalin, for example. \ 

G. Cell-Based Assays 

Cell-based assays and animal models for immune related diseases can be used to further understand 
the relationship bcrween the genes and polypeptides identified herein and the development and pathogenesis of 
immune related disease. 

In a different approach, eclb of a eel! type known to be involved in a particular immune related 
disease are cransfected with the cDNAs described herein, and the ability of these cDNAs to stimulate or inhibit 
immune function is analyzed. Suitable cells can be transfected with the desired gene, and monitored for 
immune function activity. Such transfected cell lines can then be used to test the ability of poly- or monoclonal 
antibodies or antibody compositions to inhibit or stimulate immune function, for example to modulate T-cell 
proliferation or inflammatory cell infiltration. Cells transfected with the coding sequences of the genes 
identified herein can further be used to identify drug candidates for the treatment of immune related diseases. 

In addition, primary cultures derived from transgenic animals (as described below) can be used in the 
cell-based assays herein, although stable ceil lines are preferred. Techniques to derive continuous cell lines 
from transgenic animals are well known in the an (see, e.g.. Small er ai. Mol. Cell. Biol. 5: 642-648 [ 1 985]). 

One suitable ceil based assay is the mixed lymphocyte reaction (MLR). Current Protocols in 
Immunology-, unit 3.12; edited by J E Coligan, A M Kruisbeek, D H Marglics. E M Shevach. W Strober, 
National Institutes of Health. Published by John Wiley & Sons, Inc. In this assay, the ability of a test 
compound to s timu l at e or inhibit the proliferation of activated T cells is assayed. A suspension of responder T 
cells is cultured with allogeneic stimulator cells and the prouferation of T cells is measured by uptake of 
tritiated thymidine. This assay is a general measure of T cell reactivity. Since the majority of T cells respond 
to and produce EL- 2 upon activation, differences in responsiveness in this assay in part reflect differences in IL- 
2 production by the responding cells. The MLR results can be verified by a standard fymphokine (IL-2) 
detection assay. Current Protocols in immunology, above. 3. 15, 6.3. 

A proliferative T cell response in an MLR assay may be due to direct mitogenic properties of an 
assayed molecule or to external antigen induced activation. Additional verification of the T cell stimulatory 
activity of the PRO polypeptides can be obtained by a cosrimularion assay. T cell activation requires an 
antigen specific signal mediated through the T-cell receptor (TCR) and a cc^nmulatory signal mediated 
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through a second ligand binding interaction, for example, the B7 (CD80. CD86VCD28 binding interaction. 
CD28 crosslinking increases lyinphokine secretion by activated T cells. T ccU activation has both negative and 
positive controls through the binding of ligands which have a negative or positive effect CD28 and CTLA-4 
are related glycoproteins in the Ig superfamily which bind to B7. CD28 binding to B7 has a positive 
cosomukiion effect of T cell activation: conversely. CTLA-4 binding to B7 has a negative T cell deactivating 
effect Chambers. C. A. and Allison. J. P.. Curr. Opin. Immunol. (1997) 9:396. Schwartz. R. H.. Cell (1992) 
71:1065: Linsey, P. S. and Ledbetter. J. A. Anna. /fev. Immunol. (1993) M:19l; June, C. H. et at. Immunol. 
Today(\994) ]5:321: Jenkms. ^ K.. /mmxmw i na costimulation assay, the PRO polypeptides 

are assayed for T cell costimulatory or inhibitory activity. 

PRO polypeptides, as well as other compounds of the invention, which are stimulators (costixnulators) 
of T cell proliferation and agonists, e.g.. agonist antibodies, thereto as determined by MLR and costimulation 
assays, for example, are useful in treating immune related diseases characterized by poor, suboptimal or 
inadequate immune function. These diseases are treated by stimulating the prohfcration and activation of T 
cells (and T cell mediated immunity) and enhancing the immune response in a mammal through administration 
of a stimulatory compound, such as the stimulating PRO polypeptides. The stimulating polypeptide may. for 
example, be a PRO200. PRO204. PR0212. PR0216. PR0226. PRO240. PR0235, PR0245. PR0172, 
PR0273. PR0272. PR0332. PR0526. PRO701. PR036I. PR0362. PR0363. PR0364. PR0356. PR0531, 
PR0533. PRO1083. PR0865. PRO770. PR0769. PRQ788. PRO U 14. PRO1007, PROH84, PROI03L 
PR01346. PROl 155. PRO1250. PR013 12. PROI 192, PR01246. PR01283, PROl 195, PROI343. PROI418, 
20 PROI387, PRO14I0, PR01917. PR01868. PRO205. PR02 1 . PR0269, PR0344. PR0333, PR0381. PRO720, 
PR0866. PRO840. PR0982, PR0836. PROI 159, PR01358. PR01325, PR01338, PR01434. PR04333, 
PRO4302. PRO4430 or PR05727 polypeptide or an agonist antibody thereof. 

Direct use of a stimulating compound as in me invention has been validated in experiments with 4- 
IBB glycoprotein, a member of the tumor necrosis (actor receptor mmily. which binds to a ligand (4-lBBL) 
expressed on primed T cells and signals T cell activation and growth. Alderson. M E. et mL J. Immunol. 
(1994) 24:2219. 

The use of an agonist stimulating compound has also been validated experimentally. Activation of 4- 
1BB by treatment with an agonist anti-4-lBB antibody enhances eradication of tumors. Hellstrom. I and 
Hellstrom, KL Cm. Rex, Immunol. (1998) 18: 1. Itmnunoadjuvant therapy for treatment of tumors, described 
30 in more detail below, is another example of the use of the stimulating compounds of the invention. 

An immune stimulating or enhancing effect can also be achieved by ^g™™- OT blocking the 
activity of a PRO which has been found to be inMnnng in the MLR assay. Negating the inhibitory activity of 
the compound produces a net stimulatory effect. Suitable anmgomsts/blocking compounds are antibodies or 
fragments thereof which recognize and bind to the inhibitory protein, thereby blocking the effective interaction 
35 of the protein with its receptor and inhibiting signaling through the receptor. This effect has been validated in 
experiments using anti-CTLA-4 antibodies which enhance T cell proliferation, presumably by removal of the 
inhibitory signal caused by CTIA^ binding. Walunas,T. L etai. Immunity ( 1994)1:405. 

Alternatively, an immune stimulating or enhancing effect can also be achieved by administration of a 
PRO which has vascular permeability enhancing properties. Enhanced vacuolar permeability would be 
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beneficial to disorders which can be attenuated by local infiltration of immune cells (e.g.. monocytes, 
eosinophils. PMNs) and inflammation. 

On the other hand. PRO polypeptides, as well as other compounds of the invention, which arc direct 
inhibitors of T cell proliferation/activation, lymphokine secretion, and/or vascular permeability can be directly 
5 used to suppress the immune response. These compounds are useful to reduce the degree of the immune 
response and to treat immune related diseases characterized by a hyperactive, superoptixnal. or autoimmune 
response. This use of the compounds of the invention has been validated by the experiments described above 
in which CTLA-4 binding to receptor B7 deactivates T cells. The direct inhibitory compounds of the invention 
function in an analogous manner. The use of compound which suppress vascular permeability would be 
10 expected to reduce inflammation. Such uses would be beneficial in treating conditions associated with 
excessive inflammation. 

Alternatively, compounds, e.g.. antibodies, which bind to stimulating PRO polypeptides and block the 
stimulating effect of these molecules produce a net inhibitory effect and can be used to suppress the T cell 
mediated immune response by inhibiting T cell proliferation/activation and/or lymphokine secretion. Blocking 
15 me stimulating effect of the polypeptides suppresses the immune response of the mammal. This use has been 
validated in experiments using an anti-lL2 antibody. In these experiments, the antibody binds to IL2 and 
blocks binding of IL2 to its receptor thereby achieving a T cell inhibitory effect. 

H. Animal Models 

The results of the cell based m vin-o assays can be further verified using in vivo animal models and 

20 assays for T-cell function. A variety of well known animal models can be used to further understand the role 
of the genes identified herein in the development and pathogenesis of immune related disease, and to test the 
efficacy of candidate therapeutic agents, including antibodies, and other antagonists of the native polypeptides, 
including small molecule antagonists. The in vivo nature of such models makes them predictive of responses 
in human patients. Animal models of immune related diseases include bom non-recombinant and recombinant 

25 (transgenic) animals. Non-recombinant animal models include, for example, rodent e.g., murine models. 
Such models can be generated by introducing cells into syngeneic mice using standard techniques, e.g., 
subcutaneous injection, tail vein injection, spleen implantation, intraperitoneal implantation, implantation 
under the renal capsule, etc. 

Graft-versus-host disease occurs when immunocompetent cells are transplanted into 

30 immunosuppressed or tolerant patients. The donor cells recognize and respond to host antigens. The response 
can vary from hTe threatening severe inflammation to mild cases of diarrhea and weight loss. Graft-versus-host 
disease models provide a means of assessing T cell reactivity against MHC antigens and minor transplant 
antigens. A suitable procedure is described in detail in Current Protocols in Immunology, above, unit 43. 

An animal model for skin allograft rejection is a means of testing the ability of T cells to mediate in 

35 vivo tissue destruction and a measure of their role in transplant rejection. The most common and accepted 
models use murine tail-skin grafts. Repeated exp erim e n ts have shown that skin allograft rejection is mediated 
by T cells, helper T cells and killer-effector T cells, and not antibodies. Auchmcloss, H. Jr. and Sachs, D. rL, 
Fundamental Immunology, 2nd ed., W. E. Paul ed.. Raven Press, NY, 1989, 889-992. A suitable procedure is 
described in detail in Current Protocols in Immunology, above, unit 4.4. Other transplant rejection models 
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which can be used to rest the compounds of the invention are the allogeneic heart transplant models described 
by Tanabe, M. et ai Transplantation (1994) 58:23 and Tinubu. S. A. et ai J. Immunol. ( 1994) 4330-4338. 

Animal models for delayed type hypersensitivity provides an assay of cell mediated immune function 
as well. Delayed type hypersensiaviry reactions are a T cell mediated in \ivo immune response characterized 
by inflammation which does not reach a peak until after a period of time has elapsed after challenge with an 
antigen. These reactions also occur in tissue specific autoimmune diseases such as multiple sclerosis (MS) and 
experimental autoimmune encephalomyelitis (EAE. a model for MS). A suitable procedure is described in 
detail in Current Protocols in Immunology, above, unit 4.5. 

EAE is a T cell mediated autoimmune disease characterized by T cell and mononuclear cell 
inflammation and subsequent demyelinanon of axons in the central nervous system. EAE is generally 
considered to be a relevant animal model for MS in humans. Bolton. C. Multiple Sclerosis (1995) ±:143. 
Both acute and relapsmg-remitting models have been developed The compounds of the invention can be 
tested for T cell stimulatory or inhibitory activity against immune mediated deroyelinating disease using the 
protocol described in Current Protocols in Immunology, above, units 15.1 and 15.2. See also the models for 
myelin disease tn which oligodenaiocytes or Schwann cells arc grafted into the central nervous system as 
described in Duncan. I. D. et ai Molcc Meet Today (1997) 554*561 

Contact hypersensitivity is a simple delayed type hypersensitivity in vivo assay of cell mediated 
immune function. In this procedure, cutaneous exposure to exogenous haptens which gives rise to a delayed 
type hypersensitivity reaction which is measured and quantitated. Contact sensitivity involves an initial 
sensitizing phase followed by an excitation phase. The elicharion phase occurs when the T lymphocytes 
encounter an antigen to which they have had previous contact Swelling and inflammation occur, making this 
an excellent model of human allergic contact dermatitis, A suitable procedure is described in detail in Current 
Protocols in Immunologic Eds. J. E Cologan, A. M. Kruisbeek, D. H. Margulies. E. M. Shevach and W. 
Strober. John Wiley & Sons. Inc.. 1994. unit 4.2. See also Grabbc. S. and Schwarz. T. Immun. Today ]9 (1): 
37-W(l998). 

An animal model for arthritis is collagen-induced arthritis. This model shares clinicaL histological 
and immunological characteristics of human autoimmune rheumatoid arthritis and is an acceptable model for 
human autoimmune arthritis. Mouse and rat models are characterized by synovitis, erosion of cartilage and 
subchondral bone. The compounds of the invention can be tested for activity against autoimmune arthritis 
using the protocols described in Current Protocols in Immunology, above, units 15.5. See also the model using 
a monoclonal antibody to CDI8 and VLA-4 integrals described in Issekutz. A.C et aL Immunology (1996) 
88:569. 

A model of asthma has been described in which antigen- induced airway hyper-reactivity, pulmonary 
eosinophil 811(1 inflammation are i n d uce d by sensitizing an animal with ovalbumin and then challenging the 
animal with the same protein delivered by aerosoL Several animal models (guinea pig, rat, non-human 
primate) show symptoms similar to atopic asthma in humans upon challenge with aerosol antigens. Murine 
models have many of the features of human asthma. Suitable procedures to test the compounds of the 
invention for activity and effectiveness in the treatment of asthma are described by Wolyniec W. Vf.etal.Am. 
J. Hespir. CeUMol Biol. (1998) ]8:777 and the references cited therein. 
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Additionally, the compounds of the invention can be tested on animal models for psoriasis like 
diseases. Evidence suggests a T cell pathogenesis for psoriasis. The compounds of the invention can be tested 
in the scid/scid mouse model described by Schon. M. P. et al. Nat. Med (1997) 3:183, in which the mice 
demonstrate histopathologic skin lesions resembling psoriasis. Another suitable model is the human skin/sad 
5 mouse chimera prepared as described by Nickoloft B. J. et ai. Am. J. Path. (1995) 146:580. 

Recombinant (transgenic) animal models can be engineered by introducing the coding portion of the 
genes identified herein into the genome of animals of interest, using standard techniques for producing 
transgenic animals. Animals thai can serve as a target for transgenic manipulation include, without limitation, 
mice. rats, rabbits, guinea pigs, sheep, goats, pigs, and non-human primates. e.g.. baboons, chimpanzees and 
10 monkeys. Techniques known in the art to introduce a transgene into such animals include pronuclei 
microinjection (Hoppe and Wanger, U.S. Patent No. 4,873,191); retrovirus-mediated gene transfer into germ 
lines (e.g.. Van derPutten etal.. Proc. Natl. Acad. ScL USA 82. 6148-615 [1985]); gene targeting in embryonic 
stem cells fThompson et aL Cell 56. 313-321 [1989]); electroporation of embryos (Lo. Mol. Cel. BioL 3, 
1803-1 814 (I983J); sperm-mediated gene transfer (Lavitrano et al.. Cell 57. 717-73 [1989]). For review, see. 
15 for example. U.S. Patent No. 4.736.866. 

For the purpose of the present invention, transgenic animals include those that carry the transgene 
only in pan of their cells ( "mosaic animals"). The transgene can be iniegrated either as a single transgene. or m 
concatamers. e.g.. head-to-head or hcad-to-tail tandems. Selective introduction of a transgene into a particular 
cell type is also possible by following, for example, the technique of Lasko et aL Proc. Natl. Acad Set. USA 
20 8 9.6232-636(1992). 

The expression of the transgene in transgenic animals can be monitored by standard techniques. For 
example. Southern blot analysis or PCR amplification can be used to verify the integration of the transgene. 
The level of mRNA expression can then be analyzed using techniques such as in situ hybridization. Northern 
blot analysis. PCR. or imrounocytochemistry. 
2 5 The animals may be further examined for signs of immune disease pathology, for example by 

histological examination to determine infiltration of immune cells into specific tissues. Blocking experiments 
can also be performed m which the transgenic animals are treated with the compounds of the invention to 
determine the extent of the T cell proliferation stimulation or inhibition of the compounds. In these 
experiments, blocking antibodies which bind to the PRO polypeptide, prepared as described above, are 
30 administered to the animal and the effect on immune function is determined. 

Alternatively, "knock out" animals can be constructed which have a defective or altered gene 
encoding a polypeptide identified herein, as a result of homologous recombination between the endogenous 
gene encoding the polypeptide and altered genomic DNA encoding the same polypeptide introduced into an 
embryonic cell of die animaL For example, cDNA encoding a particular polypeptide can be used to clone 
35 genomic DNA encoding that polypeptide in accordance with established techniques. A portion of the genomic 
DNA encoding a particular polypeptide can be deleted or replaced with another gene, such as a gene encoding 
a selectable marker which can be used to monitor integration. Typically, several kilobascs of unaltered 
Hanking DNA (both at the 5» and 3' ends) are included in the vector [see e.g., Thomas and Capecchi, Cell, 
SU5Q3 (1987) for a description of homologous recombination vectors]. The vector is introduced into an 
40 embryonic stem cell line (e.g„ by electroporation) and cells in which the introduced DNA has homologously 
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recombmed with the endogenous DNA are selected |see e.g.. U « aL Cell. 69:915 f!992)). The selected cells 
are then injected into a blastocyst of an animal (e.g.. a mouse or rat) to form aggregation chimeras (see eg 
Bradley, in Teratocarcinomas and Embryonic Stem Cells: A Practical Approach. E J. Robertson, ed. (RL 
Oxford, 1987). pp. , 13-152). A chimeric embryo can men be implanted into a suitable pseudoprcgnanr female 
5 fosmr animal and the embryo brought to term to craate a "knock out" animal Progeny harbonng the 
homologous* recombined DNA in their germ cells can be identified by standard techniques and used to breed 
annuals in which all cells of the animal conuun the homologous.y recombined DNA. Knockout animals can be 
characters for instance, for their ability to defend against certain paralogical conditions and for their 
development of pathological conditions due to absence of the polypeptide. 
'0 I- ImmunoAdiuvant Therapy 

In one embodiment the immunosumuiaung compounds of the invention can be used in 
unmunoadjuvan, therapy for the treatment of tumors (cancer). It is now well established that T cells recognize 
human tumor specific anngens. One group of tumor antigens, encoded by the MAGE BACE and GAGE 
famdies of genes, are silent in all adul, normal ..sues . bu, are pressed in significant amounts in rumors 
such as melanomas, lung rumors, head and neck tumors, and bladder carcinomas. DeSmet. C. ,/ at 1 1996) 
Proc. Kail. Acad. Sci. USA. 93:7,49. 1, has been shown ma, cosumulauon of T cells induces tumor recession 
and an anmumor response bom in ,i,ro and in Melero. I. et at.. Nature Medicine ( 1 997) 3:682: ICwon E 
D. e, a,.. Proc Nad. Acad Sci. US4 (1997) 94: 8099: Lynch. D. H. et at. Nature Medicine ( 19~97) 3:625: Fmn. 
O. J. and Loue. M. T„ J. Immunol. (.998) 21:1 .4. The stimulatory compounds of the invention can be 
admmrstered as adjuvants, alone or .ogcher with a grow* regulating agent cytotoxic agent or 
chemotherapeuuc agent, to stimulate T cell pmliferation/activation and an antitumor response to tumor 
anngens. The growth regulating, cymtoxic. or chemotherapeutic agent may be adminmered in conventional 
amoums usmg known administration regimes. Immunostimnlating activity by the compounds of the invention 
allows reduced amounts of the growth regulating, cytotoxic, or chemmherapeutic agents thereby potentially 
25 lowering the toxicity to the patient. 

J - Screening Assavs for Drug Candidates 

Scraening assays for drug cand.da.es are des,gned u, identify compounds that bind to or complex with 
me polypeptides encoded by the genes identified herem or a biologically active fragment thereof, or otherwise 
mterfere «,th the in.eract.on of the encoded polypeptides with other cellular proteins. Such screening assays 
w.ll include assays amenable to high-throughput screening of chemical libraries, making mem particularly 
suaaWe for identifying small molecule drag candidates. Small molecules contemplated include synthetic 
organ* or inorganic compounds, including peptides, preferably soluble peptides. (po.y)peptid.- 
unmtmoglobulin fusions, and. in particular, antibodies including, wuhout .mutation, poly- and monoclonal 
anybodies and antibody fragments, single-chain antibodies, anti-idiorypic antibodies, and chimeric or 
humamzed vers.ons of such antibodies or fragments, as well as human antibodies and antibody fragments. The 
assays can be performed in a variety of formats, including protein-protein binding assays, biochemical 
scmenmg assays, immunoassays and cell based assays, which are well characmrized in the an. 

All assays are common in mat they call for contacting the drug candidate wim a polypeptide encoded 
by a nucleic acid identified herein under conditions and for a time sufficient to allow these two component, to 
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In binding assays, the interaction is binding and the complex formed can be isolated or detected in the 
reaction mixture, in a particular embodiment, the polypeptide encoded by the gene identified herein or the 
drug candidate is immobilized on a solid phase, e.g., on a microliter plate, by covalent or non-covalent 
attachments. Non-covalent attachment generally is accomplished by coating the solid surface with a solution 
of the polypeptide and drying. Alternatively, an immobilized antibody, e.g.. a monoclonal antibody, specific 
for the polypeptide to be immobilized can be used to anchor it to a solid surface. The assay is performed by 
adding the non-immobilized component, which may be labeled by a detectable label, to the immobilized 
component. e.g.. the coated surface containing the anchored component When the reaction is complete, the 
non-reacted components are removed, e.g., by washing, and complexes anchored on the solid surface are 
detected. When the originally non-immobilized component carries a detectable label, the detection of label 
immobilized on the surface indicates that complexing occurred. Where the originally non-immobilized 
component does not carry a label, complexing can be detected, for example, by using a labelled antibody 
specifically binding the immobilized complex. 

If the candidate compound interacts with but does not bind 10 a particular protein encoded by a gene 
identified herein, us interaction with that protein can be assayed by methods well known for detecting protein- 
protein interactions. Such assays include traditional approaches, such as. cross-linking, co- 
immunoprecipiuuion. and co-purification through gradients or chromatographic columns. In addition, protein- 
protein interactions can be monitored by using a yeast-based genetic system described by Fields and co- 
workers [Fields and Song. Nature (ton Jon) 340. 245-246 (1989): Chien el a/.. Proc. NatL Acad ScL USA 88. 
9578-9582 (1991)) as disclosed by Chevray and Nathans. Prnc Nail Acad. ScL USA 89. 5789-5793 (1991). 
Many transcriptional activators, such as yeast GAL4. consist of two physically discrete modular domains, one 
acting as the DNA-binding domain, while the other one functioning as the transcription activation domain. 
The yeast expression system described in the foregoing publications (generally referred to as the "two-hybrid 
system") takes advantage of this properry. and employs two hybrid proteins, one in which the target protein is 
fused to the DNA-binding domain of GAL4. and another, in which candidate activating proteins are fused to 
the activation domain. The expression of a GALUtacZ reporter gene under control of a GAL4-activated 
promoter depends on reconstiruuon of GAL4 activity via protein-protein interaction. Colonies containing 
interacting polypeptides are detected with a chromogenic substrate for (3-galactosidase. A complete kit 
(MATCHMAKER™) for identifying protein-protein interactions between two specific proteins using the two- 
hybrid technique is commercially available from Clontech. This system can also be extended to map protein 
toroams involved in specific protein interactions as well as to pinpoint amino acid residues that are crucial for 
these interactions. 

In order to find compounds that interfere with the interaction of a gene identified herein and other 
intra- or extracellular components can be tested, a reaction mixture is usually prepared containing the product 
of the gene and the intra- or extracellular component under conditions and for a time allowing for the 
interaction and binding of the two products. To test the ability of a test compound to inhibit binding, the 
reaction is run in the absence and in the presence of the test compound. In addition, a placebo may be added to 
a third reaction mixture, to serve as positive control. The binding (complex formation) between the test 
compound and the intra- or extracellular component present in the mixture is monitored as described above. 
The formation of a complex in the control reaction(s) but not in the reaction mixture containing the test 
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compound indicates mat the test compound interferes with the interaction of the test compound and its reaction 
partner. 

Compositions and Metho ds for the Treatment of Immune Related Diseases 
The compositions useful in the treatment of immune related diseases include, without limiation. 
proteins, antibodies, small organic molecules, peptides, phosphopepudes. antisense and nbozymc molecoks. 
triple helix molecules, etc. that inhibit or stimulate immune function, for example. T cell 
proliferation/activation, lymphokine release, or immune cell infiltration. 

For example, antisense RNA and RNA molecules act to direedy block the translation of mRNA by 
hybridizing to targeted mRNA and preventing protein translation. When annsense DNA is used, 
oligodeoxwbonueleotides derived from the translation initiation site. eg., between about -10 and +10 
positions of the target gene nucleotide sequence, are preferred. 

Ribozymes are enzymatic RNA molecules capable of catalyzing the specific cleavage of RNA. 
Ribozymes act by sequence-specific hybridization to the complementary target RNA. followed by 
endonucleolytic cleavage. Specific ribozyme cleavage sites within a potential RNA target can be identified by 
known techniques. For further details see. e.g.. Rossi. Current Biology 4. 469-471 (1904). and PCT 
publicauon No. WO 97,-33551 (published September 18. 1997). 

Nucleic acid molecules in triple helix formation used to inhibit transcription should be single-stranded 
and composed of deoxynucleottdes. The base composition of these oligonucleotides is designed such that it 
promotes triple helix formation via Hoogsteen base pairing rules, which generally require sizeable stretches of 
purines or pyrimidines on one soand of a duplex. For further details see. e.g.. PCT publication No. WO 
97/3355 1 . supra. 

These molecules can be identified by any or any combination of the screening assays discussed above 
and/or by any other screening techniques well known for those skilled in the an. 
L. Antibodies 

The present invention farther provides anti-PRO antibodies and fragments thereof which mav inhibit 
tamagomsts, or stimulate .agorusts. T cell proliferation, eosinophil infiltration, vascular permeabilitv. etc 
Such ant.-PRO antibodies or fragments thereof include polyclonal, monoclonal humanized, bispecific and 
heteroconjugate antibodies. 

I- Polyclonal Antibodies 
The anti-PRO antibodies may comprise polyclonal antibodies. Methods of preparing polyclonal 
antibodies are known to the skilled anisan. Polyclonal antibodies can be raised m a mammal for example, by 
one or more injections of an immunizing agent and. if desired, an adjuvant. Typically, die immunizing agent 
and/or adjuvant will be injected in die mammal by multiple subcutaneous or mtraperitoneal injections. The 
immunizing agent may include the PRO polypeptide or a fusion protein thereof. It may be useful to conjugate 
the immunizing agent to a protem known to be tmmunogenic in the mammal being immunized. Examples of 
such immunogenic proteins include but are not limned to keyhole limpet hemocyanin. serum albumin, bovine 
thyroglobulm. and soybean trypsin inhibitor. Examples of adjuvants which may be employed include Freaks 
complete adjuvant and MPL-TDM adjuvant (monophosphoryi Lipid A. synthetic trehalose dicorynomycolate). 
The tmnnntization protocol may be selected by one alti^ 
40 2. Monoclonal Antibodies 
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The anu-PRO antibodies may, alternatively, be monoclonal antibodies. Monoclonal antibodies may 
be prepared using hybridoma methods, such as those described by Jtohler and Milstetn. Nature, 256:495 
(1975). In a hybridoma method, a mouse, hamster, or other appropriate host animal is typically immimtT^ 
with an immunizing agent to elicit lymphocytes that produce or are capable of producing antibodies that will 
specifically bind to the immunizing agent Alternatively, the lymphocytes may be imm u n ized in vitro. 

The immunizing agent will typically include the PRO polypeptide or a fusion protein thereof: 
Generally, either peripheral blood lymphocytes ("PBLs B ) are used if cells of human origin are desired, or 
spleen cells or lymph node cells are used if non-human mammalian sources are desired. The lymphocytes are 
then fused with an immortalized cell line using a suitable fusing agent, such as polyethylene glycol, to form a 
hybridoma cell {Goding. Monoclonal Antibodies: Principles and Practice, Academic Press. (1986) pp. 59- 
103]. Immortalized cell lines are usually transformed mammalian cells, particularly myeloma cells of rodent, 
bovine and human origin. Usually, rat or mouse myeloma cell lines are employed. The hybridoma cells may 
be cultured in a suitable culture medium that preferably contains one or more substances that inhibit the growth 
or survival of the untused. immortalized cells. For example, if (he parental cells lack the enzyme hypoxanthine 
guanine phosphonbosyl transferase (HGPRT or HPRT), the culture medium for the hybridomas typically will 
include hypoxanthine. ammopienn. and thymidine CHAT medium"), which substances prevent the growth of 
HGPRT-dcficient cells. 

Preferred immortalized cell lines are those that fuse efficiently, support stable high level expression of 
antibody by the selected antibody-producing cells, and are sensitive to a medium such as HAT medium. More 
preferred immortalized cell lines are murine myeloma lines, which can be obtained, for instance, from the Salk 
institute Cell Distribution Center. San Diego, California and the American Type Culture Collection. Manassas. 
Virginia. Human myeloma and mouse-human heteromyeloma cell lines also have been described for the 
production of human monoclonal antibodies [Kozbor. J. Immunol.. 133:3001 (1984); Brodeur et a/.. 
Monoclonal Antibody Production Techniques and Applications, Marcel Dekkcr. Inc.. New York. (1987) pp. 
51-63]. 

The culture medium in which the hybridoma cells are cultured can then be assayed for the presence of 
monoclonal antibodies directed against PRO. Preferably, the binding specificity of monoclonal antibodies 
produced by the hybridoma cells is determined by immunoprecipttation or by an in vitro binding assay, such as 
radioimmunoassay (RIA) or enzyme-linked immunoabsorbent assay (ELISA). Such techniques and assays are 
known in the art. The binding affinity of the monoclonal antibody can, for example, be determined by the 
Scatchard analysts of Munson and Pollard, Anal Biochem.. 107:220 ( 1980). 

After the desired hybridoma ceils are identified, the clones may be subdoned by limiting dilution 
procedures and grown by standard methods [Goding, supra]. Suitable culture media for this purpose include, 
for example, Dulbecco's Modified Eagle's Medium and RPMi-1640 medium Alternatively, the hybridoma 
cells may be grown in vivo as ascites in a mammal 

The monoclonal antibodies secreted by the subclones may be isolated or purified from the culture 
m e d i um or ascites fluid by conventional immunoglobulin purification procedures such as, for example, protein 
A-Sepharose, hydroxyapatite chromatography, gel electrophoresis, dialysis, or affinity chromatography. 

The monoclonal antibodies may also be made by recombinant DNA methods, such as those described 
in U.S. Patent No. 4,816,567. DNA encoding the monoclonal antibodies of the invention can be readily 
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isolated and sequenced using conventional procedures (e.g.. by using oligonucleotide probes thai are capable of 
binding specifically to genes men d ing the heavy and light chains of murine antibodies). The hybridoma ceils 
of the invention serve as a preferred source of such DNA. Once isolated the DNA may be placed into 
expression vectors, which are then transfected into host cells such as simian COS cells. Chinese hamster ovary 
(CHO) cells, or myeloma cells that do not otherwise produce immunoglobulin protein, to obtain the synthesis 
of monoclonal antibodies in the recombinant host cells. The DNA also may be modified, for example, by 
substituting the coding sequence for human heavy and light chain constant domains in place of the homologous 
murine sequences [U.S. Patent No. 4,816.567; Morrison et al.. supra) or by covalently joining to me 
immunoglobulin coding sequence ail or pan of the coding sequence for a non-immunoglobulin polypeptide. 
Such a non-immunoglobulin polypeptide can be substituted for the constant domains of an antibody of the 
invention: or can be substituted for the variable domains of one aimgen-combming site of an antibody of the 
invention to create a chimeric bivalent antibody. 

The antibodies are preferably monovalent antibodies. Methods for preparing monovalent antibodies 
are well known in the an. For example, one method involves recombinant expression of rnimunoglobulin light 
chain and modified heavy chain. The heavy chain is truncated generally at any point in the Fc region so as to 
prevent heavy chain crosslinking. Alternatively, the relevant cysteine residues are substituted with another 
amino acid residue or are deleted so as to prevent crosslinking. 

In vitro methods are also suitable for preparing monovalent antibodies. Digestion of antibodies to 
produce fragments thereof, particularly. Fab fragments, can be accomplished using routine techniques known 
20 in the art. 

3. Human and Humanized Antibodies 
The anti-PRO antibodies of the invention may runner comprise humanized antibodies or human 
antibodies. Humanized forms of non-human (e.g.. murine) antibodies are chimeric immunoglobulins, 
immunoglobulin chains or fragments thereof (such as Fv. Fab. Fab'. F(ab')2 or other antigen- binding 
subsequences of antibodies) which contain minimal sequence derived from non-human immunoglobulin. 
Humanized antibodies include human immunoglobulins (recipient antibody) in which residues from a 
complementary determining region (CDR) of the recipient are replaced by residues from a CDR of a non- 
human species fdonor antibody) such as mouse, rat or rabbit having the desired specificity, affinity and 
capacity. In some instances. Fv framework residues of the human rnimunoglobulin are replaced by 
corresponding non-human residues. Humanized antibodies may also comprise residues which are found 
neither in the recipient antibody nor in the imported CDR or framework sequences, to general the humanized 
antibody will comprise substantially all of at least one. and typically two. variable domains, in which all or 
substantially aU of the CDR regions correspond to those of a non-human immunogiobiiiin and all or 
substantially all of the FR regions are those of a human immunoglobulin consensus sequence. The humanized 
antibody optimally also will comprise at least a portion of an immunoglobulin constant region (Fc). typically 
mat of a human immunoglobulin [Jones et a!., Mature, 321:522-525 (1986); Riechmann et a!.. Nature, 
332:323-329 (1988); and Presta, Curr. Op. Struct. BioL 2:593-596 (1992)]. 

Methods for humanizing non-human antibodies arc well known in the art Generally, a humanized 
antibody has one or more amino acid residues introduced into it from a source which is non-mmum These 
non-human ammo acid residues are often referred to as "import" residues, which are typically taken from an 
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M impon M variable domain. Humanization can be essentially performed following the method of Winter and 
coworkers [Jones et aL Nature. 32^522-525 (1986): Riechmann et aL Nature. 332:323-327 (1988); 
Vernoeyen et a!.. Science, 239:1534-1536 (1988)), by substituting rodent CDRs or CDR sequences for the 
corresponding sequences of a human antibody. Accordingly, such "humanized" antibodies are chimeric 
antibodies (U.S. Patent No. 4.816.567), wherein substantially less than an intact human variable domain has 
been substituted by the corresponding sequence from a non-human species. In practice, humanized antibodies 
are typically human antibodies in which some CDR residues and possibly some FR residues are substituted by 
residues from analogous sites in rodent antibodies. 

Human antibodies can also be produced using various techniques known in the art. including phage 
display libraries' [Hoogenboom and Winter. /. Mol Biol.. 227:381 (1991); Marks et aL J. Mol. Biol. 222:581 
(1991)]. The techniques of Cole et aL and Boerner et aL are also available for the preparation of human 
monoclonal antibodies (Cole et aL Monoclonal Antibodies and Cancer Therapy. Alan R. Liss. p. 77 (1985); 
Boerner et aL J. Immunol. |47(l):86-95 (1991); U.S. 5J50, 373]. Similarly, human antibodies can be made 
by introducing of human immunoglobulin loci into transgenic animals, e.g.. mice in which the endogenous 
immunoglobulin genes have been partially or completely inactivated. Upon challenge, human antibody 
production is observed, which closely resembles that seen in humans in all respects, including gene 
rearrangement, assembly, and antibody repertoire. This approach is described, for example, in U.S. Patent 
Nos. 5.545.807: 5.545.806; 5.569.825: 5.625,126; 5,633.425: 5.661.016. and in the following scientific 
publications: Marks et aL BioTTechnohgy K). 779-783 (1992): Lonberg et a!.. Nature 368: 856-859 (1994): 
Morrison. Nature 368. 812-13 (1994): Fishwild et al, Nature Biotechnology- W. 845-51 (1996); Neubcrger, 
Nature Biotechnology U. 826 (1996): Lonberg and Huszar. Intern. Rev. Immunol 23 65-93 (1995). 

The antibodies may also be affinity matured using known selection and/or mutagenesis methods as 
described above. Preferred affinity matured antibodies have an affinity which is five times, more preferably 10 
times, even more preferably 20 or 30 times greater than the starting antibody (generally murine, humanized or 
human) from which the matured antibody is prepared. 

4. Bispecific Antibodies 

Bispecific antibodies arc monoclonal, preferably human or humanized, antibodies that have binding 
specificities for at least two different antigens. In the present case, one of the binding specificities may be for 
the PRO. the other one is for any other antigen, and preferably for a cell-surface protein or receptor or receptor 
subunit 

Methods for making bispecific antibodies are known in the art Traditionally, the recombinant 
production of bispecific antibodies is based on the coexpression of two immunoglobulin heavy-chain/ltght- 
chain pairs, where the two heavy chains have different specificities (MUstein and Cuello. Nature. 305:537-539 
[1983]). Because of the random assortment of immunoglobulin heavy and light chains, these hybridomas 
(quadroraas) produce a potential mixture of ten different antibody molecules, of which only one has the correct 
bispecific structure. The purification of the correct molecule is usually accomplished by affinity 
chreniatography steps. Similar procedures are disclosed in WO 93/08829, published 13 May 1993, and in 
Traunecker et aL EMBO J. 1 10:3655-3659 (1991). 

Antibody variable domains with the desired binding specificities (antibody-antigen combining sites) 
can be fused to immunoglobulin constant domain sequences. The fusion preferably is with an mmumogiobuiin 
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heavy-chain consent domain, comprising at least pan of the hinge. CHZ and CH3 regions, it is preferred to 
have the first heavy-chain constant region (CHI) comaimng the site necessary for hghi-chain binding present 
in at least one of the fusions. ONAs encoding the immunoglobulin heavy-chain fusions and. if desired, the 
tmmunoelobulin light chain, are inserted into separate expression vectors, and are cotransfected into a suitable 
host organism. For runner details of generating bispecific antibodies see. for example, Suresh er aL t Methods 
in Enzymohgy. _m :2 1 0 fl 986). 

According to another approach described in WO 9672701 1, the interlace between a pair of antibody 
molecules can be engineered to maximize the percentage of heterodimers which are recovered from 
reconibinam cell culture. Trie preferred interface cwnprises at least a pan of the CH3 region of an antibody 
constant domain. In this method, one or more small amino acid side chains from the interface of the first 
antibody molecule are replaced with larger side chains (e.g.. tyrosine or tryptophan). Cotrrpensatory "cavities" 
of identical or similar size to the large side chains) are created on the interlace of the second anubody 
molecule by replacing large amino acid side chains with smaller ones (e.g.. alanine or rhreortine). This 
provides a mechanism for increasing ihe yield of the heterodimer over other unwanted end-products such as 
1 5 ho mod liners. 

Bispecific antibodies can be prepared as full length antibodies or antibody fragments (e.g.. F(ab h 
bispectfie antibodies). Techniques for generating bispecific antibodies from antibody fragments have been 
described in the literature. For example, bispecific antibodies can be prepared can be prepared using chemical 
linkage. Brennan « at. Science 229:8! (1985) describe a procedure wherein intact antibodies am 
proteolyucally cleaved to generate Ffrb'h fragments. These fragments are reduced in the presence of the 
dithioi completing agent sodium arsenite to stabilize vicinal dithiols and prevent intennolecular disulfide 
formation. The Fab* fragments generated are then convened to duonitrobenzoate (TNB) derivatives. One of 
the Fab -TNB derivatives is then reconvened to the Fab-thiol by reduction with meroaptoethylamine and is 
mixed with an equimolar amount of the other Fab'-TNB denvative to form the bispecific antibody. The 
25 bispecific antibodies produced can be used as agents for the selective immobilization of enzymes. 

Fab" fragments may be directly recovered from E. coli and chemically coupled to form bispecific 
antibodies. Shalaby « at. J. Exp. U*<1. 175:217-225 (1992) describe the production of a folly humanized 
bispecific antibody F(ab b molecule. Each Fab" fragment was separately secreted from E. coli and subjected 
to directed chemical coupling in vim to form the bispecific antibody. The bispecific antibody thus formed was 
able to bind to cells overexposing the EibB2 receptor and normal human T cells, as well as trigger the lytic 
activity of human cytotoxic lymphocytes against human breast tumor targes. 

Various technique for making and isolating bispecific antibody fragments directly from recombinant 
cell culture have also been described. For example, bispecific antibodies have been produced using leucine 
zippers. Kostelny er at. J. Immunol. 148 (5):1547-15S3 (1992). The leucine zipper peptides from the Fos and 
Jun proteins were linked to the Fab' portions of two different antibodies by gene fusion. The antibody 
bomodiaeni were reduced at the hinge region to form monomers and then re-oxidized to form the antibody 
heterodimets. This method can also be utilized for the production of antibody homodimera. The "diabody" 
technology described by HoUmger « at, Proc Nad. Acad. So. USA 90:6444-6448 (1993) has provided an 
alternative mechanism for making bispectfie antibody fragments. The fragments comprise a heavy-chain 
variable domain (V„) connected to a light-chain variable domain (VJ by a linker which is too short to allow 
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pairing between the two domains on the same chain. Accordingly, the V„ and V t domains of one fragment 
are forced to pair with the complementary V L and V„ domains of another fragment, thereby forming two 
antigen-binding sites, Another strategy for making bispecific antibody fragments by the use of single-chain Fv 
(sFv) dimers has also been reported. See. Gruber et aL J. Immunol. ]52:5368 (1994). 

Antibodies with more than two valencies are contemplated For example, trispecific antibodies can be 
prepared. Tun et ai. . J. Immunol. 147:60 ( 1 99 1 ). 

Exemplary bispecific antibodies may bind to two different epitopes on a given PRO polypepide 
herein. Alternatively, an anri-PRO arm may be combined with an arm which binds to a triggering molecule on 
a leukocyte such as a T-cell receptor molecule (e.g. t CDZ CD3. CD28. or B7), or Fc receptors for IgG (FcyR), 
such as FcyRI (CD64). FcyRII (CD32) and FcyRm (CD16) so as to focus cellular defense nwhamsms to the 
cell expressing the particular PRO polypeptide. Bispecific antibodies may also be used to localize cytotoxic 
agents to cells which express a particular PRO polypeptide. These anybodies possess a PRO polypeptide - 
binding arm and an arm which binds a cytotoxic agent or a radionuclide chelator, such as EOTUBE, DPT A. 
DOTA. or TETA. Another bispecific antibody of interest binds the PRO polypeptide and further binds tissue 
15 factor (TF) 

5. HetcrocomuBatc Antibodies 

Heteroconjugate antibodies arc composed of two covalently joined antibodies. Such antibodies have, 
for example, been proposed to target immune system cells to unwanted cells [US. Patent No. 4.676.980], and 
for treatment of HIV infection [WO 91/00360; WO 92/200373: EP 03089). It is contemplated that the 
20 antibodies may be prepared in vitro using known methods in synthetic protein chemistry, including those 
involving crowunking agents. For example, immunotoxins may be constructed using a disulfide exchange 
reaction or by forming a thioether bond Examples of suitable reagents for this purpose include iminothiolate 
and memyW-mercaptobutynmidate and those disclosed, for example, in U.S. Patent No. 4.676.980. 

6. Effector function engineering 

25 It may be desirable to modify the antibody of the invention with respect to effector function, so as to 

enhance the effectiveness of the antibody in treating an immune related disease, for example. For example 
cysteine residues) may be introduced in the Fc region, thereby allowing interchain disulfide bond formation in 
this region. The homodimehc antibody thus generated may have improved internalization capability and/or 
increased complement-mediated cell killing and antibody-dependent cellular cytotoxicity ( ADCO. See Caron 
*/ al.. J. Exp Med. 176:1191.1195 (1992) and Shopes, B. J. Immunol. 148:2918-2922 (1992). Homodimeric 
antibodies with enh an ced anti-tumor activity may also be prepared using heteromnmctimud cross-linkers as 
described in Wolff et ai. Cancer Research 53:2560-2565 (1993). Alternatively, an antibody can be engineered 
which has dual Fc regions and may thereby have enhanced complement lysis and ADCC capabilities. See 
Stevenson et ai.. Anti-Cancer Drug Design 2:219-230 (1989). 
35 7. Immunocontugates 

The invention also pertains to immunoconjugates comprising an antibody conjugated to a cytotoxic 
agent such as a chemotberapeutic agent, toxin (e.g.. an cnzymatically active toxin of bacterial, fungal plant or 
animal origin, or fragments thereof), or a radioactive isotope (i.e., a radioconjugate). 

Chemomerapeutic agents useful in the generation of such immunoconjugates have been described 
above. Enzymancally active toxins and fragments thereof which can be used include diphtheria A chain, 
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nonbinding active fragments of diphtheria toxin, exotoxin A chain (from Pseudomonas aeruginosa), ricin A 
chain, abrin A chain, modeccin A chain, alpha-sarcin. Aleurites fordii proteins, diaxnhxn proteins. Phytolaca 
americana proteins (PAPL PAPIL and PAP-S), momordica charantia inhibitor, curcin, crotm. sapaonaria 
officinalis inhibitor, gelonin. mitogellin. rearictochx phenomycin. enomycin and the tricotheceues. A variety 
of radionuclides are available for the production of radioconjugated antibodies Examples include a,2 Bu ,Jl r 

Conjugates of the antibody and cytotoxic agent are made using a variety of bifuncuonai protein 
coupling agents such as N-^cciiumidyl-3K2-r^dyla^thion propionate (STOP), umnothiolane (IT), 
birnncrional derivatives of imidoesters (such as dimethyl adiphnidate HCL). active esters (such as 
disuccinirnidyl subcraie). aldehydes (such as ghitaraidehyde). bis-azido cornpounds (such as bis (p- 
azidobenzoyl) hexanediamine). bis-diazonium derivatives (such as bis^liazoniumbenzoyl). 
ethylenediamme). diisocyanates (such as tolyene 2.6^iisocyanate). and bis-active fluorine compounds (such as 
l4Hhfluoro-2,4^mitrobcnzene). For example, a ricin immunotoxin can be prepared as described in Vitetta et 
aL . Science 238: 1098 (1987). Carbon- 1 4-labeled NisotmocyanatobetizylO-m^ 
tnarmnepcntaacetic acid (MX-DTPA) is an exemplary chelating agent for conjugation of radionuclide to me 
antibody. See WO94/11026. 

In another embodiment, the antibody may be conjugated to a -receptor" (such streptavidin) for 
utilization in tissue pretargeting wherein the antibody-receptor conjugate is administered to the patient 
followed by removal of unbound conjugate from the circulation using a clearing agent and then administration 
of a -licand" (e.g.. avidin) which is conjugated to a cytotoxic agent {e.g.. a ladwrtucleotide). 
8. Itrununolrposomes 
The proteins, antibodies, etc disclosed herein may also be formulated as immunoirposomcs. 
Liposomes containing the antibody are prepared by methods known in the art. such as described in Epstein et 
al.. Proc Nail. Acad Scu USA.JliHU (1985): Hwang a a!,. Proc. Natl Acad. Sci USA. 77:4030 (1980); 
25 and U.S. Pat Nos. 4.485.045 and 4.544.545. Liposomes with enhanced circulation time are disclosed in U.S. 
Patent No. 5.013.556. 

Particularly useful liposomes can be generated by the reverse phase evaporation method with a lipid 
composition cornprising phosphatidylcholine, cholesterol and PEG-derivatized phosphatidylcthanolarnme 
(PEG-PE). Liposomes are extruded through filters of defined pore size to yield liposomes with the desired 
diameter. Fab' fragments of the antibody of the present invention can be conjugated to the liposomes as 
described in Martin et at..J. Biol Oiem. 257: 286-288 (1982) via a disulfide interchange reaction. A 
chemotherapeutic agent (such as doxorubicin) may be optionally contained within the liposome. See Gabizon 
et a/., J. National Cancer Inst. 8£ ( 19) 1484 ( 1 989). 
M. Pharmaceutical Compositions 

The active PRO molecules of tbe invention {e.g., PRO polypeptides. anti-PRO antibodies, and/or 
vaianisof each) as well as other molecules identified by the screening assays disclosed above, can be 
adnmisteredfor the treatment of immune related diseases, in the form of pharmaceutical compositions. 

Therapeutic formulations of the active PRO molecule, preferably a polypeptide or antibody of the 
tavention. are prepared for storage by mixing the active molecule having the desired degree of purity with 
optional pharmaceutical^ acceptable carriers, excipiems or stabilizers (Remington's Pharmaceutical Sciences 
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16th edition. OsoL A. Ed. [1980]), in the fonn of lyophilized formulations or aqueous solutions. Acceptable 
earners, exctpients. or stabilizers are nontoxic to recipients at the dosages and concentrations employed, and 
include buffers such as phosphate, citrate, and other organic acids; antioxidants including ascorbic acid and 
methionine; preservatives (such as octadecyidimethylbenzyl ammonium chloride: hexamethonium chloride; 
bcnzalkonium chloride, benzethonium chloride; phenol buryl or benzyl alcohol; alkyl parabens such as methyl 
or propyl paraben: catechol; rcsorcinol: cyclohexanol; 3-pemanol: and m-cresol); low molecular weight (less 
than about 10 residues) polypeptides: proteins, such as serum albumin, gelatin, or uronunogbbulins: 
hydrophilic polymers such as polyvinylpyrrolidone; amino acids such as glycine, glutamine, asparagine, 
hisridinc. arginine. or lysine; monosaccharides, disaccharides. and other carbohydrates including glucose, 
rnannose. or dextrins: chelating agents such as EDTA; sugars such as sucrose, manniiol. trehalose or sorbitol; 
salt-forming counter-ions such as sodium: metal complexes (e.g., Zn-protein complexes): and/or non-ionic 
surfectams such as TWEEN™, PLURONICS™ or polyethylene glycol (PEC). 

Compounds identified by the screening assays disclosed herein can be formulated in an analogous 
manner, using standard techniques well known in the art. 

Lipofecuons or liposomes can also be used to deliver the PRO molecule into cells. Where antibody 
fragments arc used, the smallest inhibitory fragment which specifically binds to the binding domain of the 
target protein is preferred. For example, based upon the variable region sequences of an antibody, peptide 
molecules can be designed which retain the ability to bind the target protein sequence. Such peptides can be 
synthesized chemically and/or produced by recombinant DNA technology (sec e.g.. Marasco ci aL Proa Natl. 
Acad Set. USA 90. 7889-7893 [1993]). 

The formulation herein may also contain more than one active compound as necessary for the 
particular indication being treated, preferably those with complementary activities that do not adversely affect 
each other. Alternatively, or in addition, the composition may comprise a cytotoxic agent, cytokine or growth 
inhibitory agent. Such molecules are suitably present in combination in amounts that are effective for the 
purpose intended. 

The active PRO molecules may also be entrapped in microcapsules prepared, for example, by 
coacervahon techniques or by interfecial polymerization, for example, hydroxymethylceilulose or gelatin- 
microcapsules and poly-(methylmethacylaie) microcapsules, respectively, in colloidal drug delivery systems 
(for example. liposomes, albumin microspheres, microemulsions. nano-paiticles and nanocapsules) or in 
macroemulsions. Such techniques are disclosed in Remington's Pharmaceutical Sciences 16th edition. Osol, 
A_ Ed. (1980). 

The formulations to be used for in vivo administration must be sterile. This is readily accomplished 
by filtration through sterile filtration membranes. 

Sustained-release preparations or the PRO molecules may be prepared. Suitable examples of 
sustained-release preparations include semipermeable matrices of solid hydrophobic polymers containing the 
antibody, which matrices are in the form of shaped articles, e.g.. films, or microcapsules. Examples of 
suaained-relcase matrices include polyesters, hydrogels (for example, poly(2-hydroxyemyl-metruicr^ or 
poWvinylalcohoI)), polylactides (U.S. PaL No. 3,773,919), copolymers of L-glutamic acid and y-emyl-L- 
glmamate, non-degradable ethylene-vinyl acetate, degradabie lactic acid-ghycolic acid copolymers such as the 
LUPRON DEPOT™ (injectable microspheres composed of lactic acid-irlycoiic acid copolymer and leuprolide 



78 



SUBSTITUTE SHEET (RULE 26) 



(C) 2003 Copyright Derwent Information Ltd. 



WO 00/53758 



PCT/US0O/D584I 



20 



acetate), and poIy-D-(-)-3-hydroxybutyric acid. Whfle polymers such as ethylene- vinyl acetate and lactic acid- 
glycolic acid enable release of molecules for over 100 days, certain hydrogels release proteins for shorter time 
periods. When encapsulated antibodies remain in the body for a long time, they may denature or aggregate as a 
result of exposure to moisture at 3T>C, resulting in a loss of biological activity and possible changes m 
5 inununogemciry. Rational strategies can be devised for stabilization depending on the mechanism involved. 
For example, if the aggregation mechanism is discovered to be imermolecular S-S bond formation through 
tmoHiisulrtde interchange, stabilization may be achieved by modifying suifbydryl residues, lyophilizing from 
acidic solutions, controlling moisture content, using appropriate additives, and developing specific polymer 
matrix compositions. 

10 N. Methods of Treatment 

It is contemplated that the polypeptides, antibodies and other active cenroounds of the present 
invention may be used to treat various immune related diseases and conditions, such as T cell mediated 
diseases, including those characterized by infiltration of inflammatory cells into a tissue, stimulation of T-cell 
proliferation, inhibition of T-cell proliferation, increased or decreased vascular permeability or the inhibition 

1 5 thereof. 

Exemplary conditions or disorders to be treated with the polypeptides, antibodies and other 
conmounds of the invention, include, but are not limited to systemic lupus erythematosis. rheumatoid anhritis. 
juvenile chronic arthritis, osteoarthritis, spondyloarmropathies. systemic sclerosis (scleroderma), idiopathic 
inflammatory myopathies (dermatorayositis. polymyositis). Sjogren's syndrome, systemic vasculitis, 
sarcoidosis, autoimmune hemolytic anemia (immune pancytopenia, paroxysmal nocturnal hemoglobinuria), 
autoimmune thrombocytopenia (idiopathic thrombocytopenic purpura, immune-mediated thrombocytopenia), 
thyroiditis (Grave's disease. Hashimoto's thyroiditis, juvenile lymphocytic thyroiditis, atrophic myrmditisK 
diabetes meilirus. inrnmrie-mediated renal disease (glomerulonephritis, tubulointemitiai nephritis), 
demyelinating diseases of the central and peripheral nervous systems such as multiple sclerosis, idiopathic 
25 demyelinating polyneuropathy or Gtiillain-Barre syndrome, and chronic mflarnrnatory demyelinating 
polyneuropathy, hepatobiliary diseases such as infectious hepatitis (hepatitis A. B. C. D. E and other non- 
hepatotropic viruses), aumimmune chronic active hepatitis, primary biliary cirrhosis, granulomatous hepatitis, 
and sclerosing cholangitis, inflammatory bowel disease (ulcerative colitis: Crohn's disease), gluten-sensitive 
enteropathy, and Whipple's disease, auiormmune or immune-mediated skin diseases including bullous skin 
diseases, erythema multiforme and contact dermatitis, psoriasis, allergic diseases such as asthma, allergic 
retinitis, atopic dermatitis, food hypersensitivity and urticaria, immunologic diseases of the lung such as 
eosinophilic pneumonias, idiopathic pulmonary fibrosis and hypersensitivity pneumonitis, transplantation 
associated diseases including graft rejection and graft -versus-host-disease. 

In systemic lupus erythematosus, the central mediator of disease is the production of auto-reactive 
antibodies to self proteins/tissues and the subsequent generation of mimune-mediated inflammation, antibodies 
either direcriy or mdirectly mediate tissue injury. Though T lymphocytes have not been shown to be directly 
mvoived in tissue damage, T lymphocytes are retired for the development of auto-reactive antibodies. The 
genesis of the disease is thus T lymphocyte dependent. Multiple organs and systems are affected clinically 
mduding kidney, hang, nrusailoskcletal system, mucc«utaneous, eye, central nervous system, cardiovascular 
system, gastromtestinal tract, bone marrow and blood. 
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Rheumatoid arthritis (RA) is a chronic systemic autoimmune inflammatory disease that mainly 
involves the synovial membrane of multiple joints with resultant injury to the articular cartilage. The 
pathogenesis is T lymphocyte dependent and is associated with the production of rheumatoid factors, auto- 
antibodies directed against self IgG. with the resultant formation of immune complexes that attain high levels 
in joint fluid and blood These complexes in the joint may induce the marked infiltrate of lymphocytes and 
monocytes into the synovium and subsequent marked synovial changes; the joint space/fluid if infiltrated by 
similar cells with the addition of numerous neutrophils. Tissues afTected are primarily the joints, often in 
symmetrical pattern. However, extra-articular disease also occurs in two major forms. One form is the 
development of extra-articular lesions wich ongoing progressive joint disease and typical lesions of pulmonary 
fibrosis, vasculitis, and cutaneous ulcers. The second form of extra-articular disease is the so called Felly's 
syndrome which occurs late in the RA disease course, sometimes after joint disease has become quiescent, and 
involves the presence of neutropenia, thrombocytopenia and splenomegaly. This can be accompanied by 
vasculitis in multiple organs wiih formations of infarcts, skin ulcers and gangrene. Patients often also develop 
rheumatoid nodules in the subcutis tissue overlying affected joints: the nodules late stage have necrotic centers 
surrounded by a mixed mflammaiory cell infiltrate. Other rnaniiestations which can occur in RA include: 
pericarditis, pleurilis. coronary arteritis, imestnial pneumonitis with pulmonary fibrosis, keratoconjunctivitis 
sicca, and rhematoid nodules. 

Juvenile chronic arthritis is a chronic idiopathic inllammatory disease which begins often at less than 
16 years of age. Its phenorypc has some similarities to RA; some patients which are rhematoid factor posiuve 
are classified as juvenile rheumatoid arthritis. The disease is sub-classified into three major categories: 
pauciarricular. polyarticular, and systemic. The arthritis can be severe and is typically destructive and leads to 
joint ankylosis and retarded growth. Other manifestations can include chronic anterior uveitis and systemic 
amyloidosis. 

Spondyloarthropathies are a group of disorders with some common clinical features and the common 
association with the expression of HLA-B27 gene product The disorders include: ankylosing sponylitis. 
Rater's syndrome (reactive anhntis). arthritis associated with infiaramatory bowel disease, spondylitis 
associated with psoriasis, juvenile onset spondyloarthropathy and undifferentiated spondyloarthropathy. 
Distmguishing features include sacroileitis with or without spondylitis: mflainmatory asymmetric arthritis: 
association with HLA-B27 (a serologically defined allele of the HLA-B locus of class 1 MHC); ocular 
inOammation. and absence of autoantibodies associated with other rheumatoid disease. The ceil most 
implicated as key to induction of the disease is the CD8+ T lymphocyte, a cell which targets antigen presented 
by class 1 MHC molecules. CD8+ T cells may react against the class I MHC allele HLA-B 2 7 as if it were a 
foreign peptide expressed by MHC class 1 molecules, it has been hypothesized that an epitope of HLA-B27 
may mimic a bacterial or other microbial antigenic epitope and thus induce a CD8+ T cells response. 

Systemic sclerosis (scleroderma) has an unknown etiology. A hallmark of the disease is induration of 
the skin; likely this is induced by an active inflammatory process. Scleroderma can be localized or systemic: 
vascular lesions are common and endothelial cell injury in the microvascular^ is an early and important event 
in the development of systemic sclerosis: the vascular injury may be immune mediated. An immunologic basis 
is implied by the presence of mononuclear cell infiltrates in the cutaneous lesions and the presence of ami- 
nuclear antibodies in many patients. ICAM-l is often unregulated on the cell surface of fibroblasts in skin 
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lesions suggesting that T cell interaction with these cells may have a role m the pathogenesis of the disease. 
Other organs involved inc l u de: the gastrointestinal trace smooth muscle atrophy and fibrosis resulting in 
abnormal peristalsis/mou'liiy. kidney: concentric subepithelial intimai proliferation affecting small arcuate 
and interlobular arteries with resultant reduced renal conical blood flow, result* in proteinuria, azotemia and 
hypertension: skeletal muscle: atrophy, interstitial fibrosis: inflammation; lung: interstitial pneumonitis and 
interstitial fibrosis: and heart contraction band necrosis, scarring/fibrosis. 

Idiopathic inflammatory myopathies including dermatomyositis. polymyositis and others am disorders 
of chronic muscle inflammation of unknown etiology resulting in muscle weakness. Muscle 
injury/inflamniation is often symmetric and progressive. Autoantibodies are associated with most forms. 
These myositi«pecific autoantibodies are directed against and inhibit the function of components, proteins 
and RNA's. involved in protein synthesis. 

Sjogren* syndrome is due to immune-mediated inflammation and subsequent functional destruction 
of the tear glands and salivary glands. The disease can be associated with or accompanied by inflammatory 
connective tissue diseases. Trie disease is associated with autoantibody production against Ro and La antigens, 
both of which are small RNA-protein complexes. Lesions resuli in kcratoconiunctivitis sicca, xerostomia, with 
other manifestations or associations including bilary cirrhosis, peripheral or sensory neuropathy, and palpable 
purpura. 

Systemic vasculitis arc diseases in which the primary lesion is inflammation and subsequent damage 
to blood vessels which results in ischemia/necrosis/degeneration to tissues supplied by the affected vessels and 
eventual end-organ dysfunction in some cases. Vasculitides can also occur as a secondary lesion or sequelae to 
other inmune-innammatory mediated diseases such as rheumatoid arthritis, systemic sclerosis, etc. 
particularly in diseases also associated with the formation of immune complexes. Diseases in the primary 
systemic vasculitis group include: systemic necrotizing vasculitis: polyarteritis nodosa, allergic angiitis and 
granulomatosis, polyangiitis: Wegener's granulomatosis: lymphomatoid granulomatosis: and giant cell arteritis. 
Miscellaneous vasculitides include: mucocutaneous lymph node syndrome (MLNS or Kawasaki's disease), 
isolated CNS vasculitis, Behefs disease, thromboangiitis obliterans (Buerger's disease) and cutaneous 
necrotizing venuiitis. The pathogenic mechanism of most of (he types of vasculitis listed is believed to be 
primarily due to the deposition of immunoglobulin complexes in the vessel wall and subsequent induction of 
an inflammatory response either via ADCC. complement activation, or both. 

Sarcoidosis is a condition of unknown etiology which is characterized by the presence of epithelioid 
granulomas tn nearly any tissue in the body; involvement of the lung is most common. The pathogenesis 
involves the persistence of activated macrophages and lymphoid cells at sites of the disease with subsequent 
chronic sequelae resultant from the release of locally and systemicaUy active products released by these cell 
types. 

Autoimmune hemolytic anemia including autoimmune hemolytic anemia, immune pancytopenia, and 
paroxysmal nocture! hemoglobinuria is a result of production of antibodies that react with antigens expressed 
on the surface of red blood cells (and in some cases other blood cells including platelets as well) and is a 
reflection of the removal of those antibody coated cells via complement mediated lysis and/or ADCC/Fc- 
recepror-mrdiared ""xhmm mg 
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In autoimmune thrombocytopenia including thrombocytopenic purpura, and mumme-mediated 
thrombocytopenia in other clinical settings, platelet destruction/ removal occurs as a result of either antibody or 
complement attaching to platelets and subsequent removal by complement lysis, ADCC or FC-recepxor 
mediated mechanisms 

Thyroiditis including Grave's dis e ase. Hashimoto's thyroiditis, juvenile lynnihocytic myroiditis. and 
atrophic thyroiditis, are the result of an autoimmune response against thyroid antigens with production of 
antibodies that react with proteins present in and often specific for the thyroid gknH Experimental models 
exist including spontaneous models: rats (BUF and BB rats) and chickens (obese chicken strain); inducible 
models: immunization of animals with either thyroglobulin. thyroid microsomal antigen (thyroid peroxidase). 

Type I diabetes meilitus or insulin-dependent diabetes is the autoinunune destruction of pancreatic 
islet P cells: this destruction is mediated by auto-anubodies and auto-reactive T cells. Antibodies to insulin or 
the insulin receptor can also produce the phenorype of insulin-non-responsiveness. 

Immune mediated renal diseases, including glomerulonephritis and tububmterstitial nephritis, arc the 
result of andbody or T lymphocyte mediated injury to renal tissue either directly as a result of the production of 
autoreactive antibodies or T cells against renal antigens or indirectly as a resuh of the deposition of antibodies 
and/or immune complexes in the kidney that are reactive against other, non-renal amigens. Thus other 
immune-mediated diseases that result in the formation of immune-complexes can also induce immune 
mediated renal disease as an indirect sequelae. Both direct and indirect immune mechanisms result in 
inOammaiory response that produces/induces lesion development in renal tissues with resultant organ function 
impairment and in some cases progression to renal failure. Both humoral and cellular immune mechanisms can 
be involved in the pathogenesis of lesions. 

Denryelinanng diseases of the central and peripheral nervous systems, including Multiple Sclerosis: 
idiopathic demyeiinating polyneuropathy or Guillain-Barre syndrome: and Chronic Inflammatory 
Demyelinating Polyneuropathy, arc believed to have an autoimmune basis and result in nerve demyeiination as 
25 a result of damage caused to oligodendrocytes or to myelin directly. In MS there is evidence to suggest that 
disease induction and progression is dependent on T lymphocytes. Multiple Sclerosis is a demyelinating 
disease that is T lymphocyte-dependent and has either a relapsing-remining course or a chronic progressive 
course. The etiology is unknown: however, viral infections, genetic predisposition, environment, and 
autoimmunity all contnbute. Lesions contain infiltrates of predominantly T lymphocyte mediated, microglial 
cells and derating macrophages: CD4+T lymphocytes are the predominant cell type at lesions. The 
mechanism of ohgodendrocyte cell death and subsequent demyeiination is not known but is likely T 
lymphocyte driven. 

Inflammatory and Fibrotic Lung Disease, mcluding Eosinophilic Pneumonias; Idiopathic Pulmonary 
Fibrosis, and Hypersensitivity Pneumonitis may involve a deregulated inimune-mflarnniatory response. 
35 Inhibition of that response would be of therapeutic benefit 

Autoimmune or Immune-mediated Skin Disease including Bullous Skin Diseases, Erythema 
Multiforme, and Contact Dermatitis are mediated by auto-anubodies, the genesis of which is T lymphocyte- 
dep e n de nt. 

Psoriasis is a T lymphocyte-mediated inflammatory disease. Lesions contain infiltrates of T 
lymphocytes, macrophages and antigen processing cells, and some neutrophils. 
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Allergic diseases, including asthma: allergic rhinitis atopic dermatitis: food hypersensitivity; and 
urticaria are T lymphocyte dependent. These diseases are predominantly mediated by T lymphocyte induced 
in flammati on. IgE mediated-inflammation or a combination of both. 

Transplantation associated diseases, including Graft rejection and Graft-Vetsus-Host-Disease 
(CVHD) are T lymphocyte-dependent: inhibition of T lymphocyte function is ameliorative. 
Other diseases in which intervention of the immune and/or inflammatory response have benefit are infectious 
disease including but not limited to viral infection (including but not limited to AIDS, hepatitis A, B. C. D. E 
and herpes) bacterial infection, fungal infections, and protozoal and parasitic infections (molecules (or 
derivatives-agonists) which stimulate the MLR can be utilized therapeutically to enhance the immune response 
to infectious agents), diseases of immunodeficiency (molecules/derivatives/agonists) which stimulate the MLR 
can be utilized therapeutically to enhance the immune response for conditions of inherited, acquired, infectious 
induced (as in HIV infection), or iatrogenic (ie.. as from chemotherapy) immunodeficiency), and neoplasia. 

It has been demonstrated that some human cancer patients develop an antibody and/or T lymphocyte 
response to antigens on neoplastic cells. It has also been shown in animal models of neoplasia that 
enhancement of the immune response can result in rejection or regression of that particular neoplasm 
Molecules that enhance the T lymphocyte response in the MLR have utility in vhv in enhancing the immune 
response against neoplasia. Molecules which enhance the T lymphocyte proliferate response in the MLR (or 
small molecule agonists or antibodies that affected the same receptor in an agonistic fashion) can be used 
therapeutically to treat cancer. Molecules that inhibit the lymphocyte response in the MLR also function in 
mm during neoplasia to suppress the immune response to a neoplasm: such molecules can either be expressed 
by the neoplastic cells themselves or their expression can be induced by the neoplasm in other cells. 
Antagonism of such inhibitory molecules (either with antibody, small molecule antagonists or other means) 
enhances immune-mediated tumor rejection. 

Additionally, inhibition of molecules with proinflammatory properties may have therapeutic benefit in 
rcperfusion injury: stroke: myocardial infarction: atherosclerosis: acute lung injury: hemorrhagic shock bum: 
sepsis-septic shock: acute tubular necrosis: endometriosis degenerative joint disease and pancreatic 

The compounds of the present invention. e . s .. polypeptides or antibodies, are administered to a 
mammal, preferably a human, in accord with known methods, such as intravenous administration as a bolus or 
by continuous infusion over a period of time, by intramuscular, intraperitoneal intracerobrospinai 
subcutaneous, intra-articnlar. intrasynoviaL intrathecal oraL topical or inhalation (intranasal intrapulmonaty) 
routes. Intravenous or inhaled administration of polypeptides and antibodies is preferred. 

In hmmmoadjuvant therapy, other therapeutic regimens, such administration of an anti-cancer agent, 
may be combined with the administration of the proteins, antibodies or compounds of the instant invention. 
For example, the patient to be treated with a the immunoadjuvnm of the invention may also receive an anti- 
cancer agent (cheaotherapeutic agent) or radiation therapy. Preparation and dosing schedules for such 
chemotherapeuuc agents may be used according to manufacturers' instructions or as determined empirically by 
the skilled practitioner. Preparation and dosing schedules for such chemotherapy are also described in 
Chemotherapy Service Ed.. M.C. Perry, Wflliams & Wilkins. Baltimore. MD (1992). The chemotherapeutic 
agent may precede, or follow administration of the immmioadjuvant or nay be given simulant 
Additionally, an aim-oestrogen compound such as tamoxifen or an anti-progesterone such as onapristone (sec 
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EP 616812) may be given in dosages known for such molecules. 

It may be desirable to also administer antibodies agains t other immune associated or tumor 

associated antigens, such as antibodies which bind to CD20, CD1 la. CD18. ErbB2. EGFR, ErbB3. ErbB4, or 
vascular endothelial factor ( VEGF). Alternatively, or in addition, two or more antibodies binding the same or 
two or more different antigens disclosed herein may be coadministered to the patient. Sometimes, it may be 
beneficial to also administer one or more cytokines ro the patient In one embodiment, the PRO polypeptides 
are coadministered with a growth inhibitory agent For example, the growth inhibitory agent may be 
administered first, followed by a PRO polypeptide. However, simultaneous administration or administration 
first is also contemplated. Suitable dosages for the growth inhibitory agent are those presently used and may 
be lowered due to the combined action (synergy) of the growth inhibitory agent and the PRO polypeptide. 

For the treatment or reduction in the severity of immune related disease, the appropri ate dosage of an 
a compound of the invention will depend on the type of disease to be treated, as defined above, the severity and 
course of the disease, whether the agent is administered for preventive or therapeutic purposes, previous 
therapy, the patient's clinical history and response to the compound, and the discretion of the attending 
physician. The compound is suitably administered to the patient at one time or over a series of treatments. 

For example, depending on the type and seventy of the disease, about 1 ug/kg to 15 mg/kg (e.g., 0.1- 
20 mg.'kc; of polypeptide or antibody is an initial candidate dosage for adminisnation to the patient, whether, 
for example, by one or more separate administrations, or by continuous infusion. A typical daily dosage might 
range from about I ug/kg to 100 mg/kg or more, depending on the factors mentioned above. For repeated 
administrations over several days or longer, depending on the condition, the treatment is sustained until a 
desired suppression of disease symptoms occurs. However, other dosage regimens may be useful The 
progress of this therapy is easily monitored by conventional techniques and assays. 
O. Articles of Manufacture 

In another embodiment of the invention, an article of manufacture containing materials (e.g., 
comprising a PRO molecule) useful for the diagnosis or treatment of the disorders described above is provided. 
The article of manufacture comprises a container and an instruction. Suitable containers include, for example, 
bottles, viais. syringes, and test tubes. The containers may be formed from a variety of materials such as glass 
or plastic. The container holds a composiuon which is effective for diagnosing or treating the condition and 
may have a sterile access port (for example the container may be an intravenous solution bag or a vial having a 
stopper pierceable by a hypodermic injection needle). The active agent in the composition is usually a 
polypeptide or an antibody of the invention. An instruction or label on. or with, the container 

indicates that the composiuon is used for diagnosing or treating the condition of choice. The arncle of 
manufacture may further comprise a second container comprising a pharmaceutical! y-acccptable buffer, such 
as phosphate-buffered saline. Ringer's solution and dextrose solution It may further include other materials 
desirable from a commercial and user standpoint, including other buffers, diluents, filters, needles, syringes, 
and package inserts with instructions for use. 

p - Diagnosis an d Prognosis of Immune Related Disease 

Cell surface proteins, such as proteins which are overexpressed in certain immune related diseases, are 
excellent targets for drug candidates or disease treatment The same proteins along with secreted proteins 
encoded by the genes amplified in immune related disease states find additional use in the diagnosis and 
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prognosis of these diseases. For example, antibodies directed against the protein products of genes amplified 
in multiple sclerosis, rheumaioid arthritis, or another immune rela ted disease, can be used as diagnostics or 
prognostics. 

For example, antibodies, including amibody fragments, can be used to qualitatively or quantitatively 
detect the expression of proteins encoded by amplified or overexpressed genes ("marker gene products'). The 
antibody preferably is equipped with a detectable, e.g.. fluorescent label, and binding can be monitored by light 
microscopy, flow cytometry, fluorimetry, or other techniques known in the art These techniques are 
particularly suitable, if the overexpressed gene encodes a ceil surfcee protein Such binding assays are 
performed essentially as described above. 

In Jito detection of anribody binding to the marker gene products can be performed, for example, by 
immuiiotluorescence or immunoelectron microscopy. For this purpose, o histological specimen is removed 
from the patient, and a labeled antibody is applied to it, preferably by overlaying the antibody on a biological 
sample. This procedure also allows for determining the distribution of the marker gene product in the tissue 
examined. It will be apparent for those skilled in the an that a wide variety of histological methods are readily 
1 5 available for in situ detection. 

The following examples arc offered lor illustrative purposes only, and are not intended to limit the 
scope of the present invention in any way. 

All patent and literature references cited in the present specification are hereby incorporated by 
reference in their entirety. 
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EXAMPLES 

Commercially available reagents referred to in the examples were used according to manufacturer's 
instructions unless otherwise indicated. The source of those cells identified in the following examples, and 
throughout the specification, by ATCC accession numbers is the American Type Culture Collection. Manassas. 
25 VA. Unless otherwise noted, the present invention uses standard procedures of recombinant DNA technology, 
such as those described hereinabove and in the following textbooks: Sambrook et ai. Molecular Cloning: A 
Laboraton- Manual Cold Spnng Harbor Press N.Y.. 1989; Ausubel et at.. Current Protocols in Molecular 
Biology. Green Publishing Associates and Wiley mterscience, N.Y., 1989; Innis et ai.. PCR Protocols; A 
Guide to Methods and Applications. Academic Press, inc. N.Y., 1990; Harlow et aL Antibodies: A laboratory* 
Manual. Cold Spring Harbor Press, Cold Spring Harbor. 1988; Gait, MX Oligonucleotide Synthesis. IRL 
Press, Oxford, 1984; RJ. Freshney, Animal Cell Culture, 1987; Coligan et a!.. Current Protocols in 
Immunoiogy, 1991. 
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EXAMPLE 1 

35 Isolation of cDNA clones Encoding Human PRO200, PRO204, PR0212, PR02I6. PR0226. PRO240, 
PR0235. PR0245, PR0172, PR0273, PR0272, PR0332, PR0526. PRO70I. PR0361, PR0362. PR0363. 
PR0364, PR0356, PR0531 . PR0533, PRO1083, PR0865, PRO770, PR0769, PR0788. PROl 1 14. PRO1007. 
PROI184. PRO1031, PR01346, PROll 55. PROI250, PR013I2, PROl 192, PR01246. PROI283, PROl 195, 
PR01343, PROI418. PR01387, PRO14I0, PR01917, PRD1868, PRO205, PR021, PR0269, PR0344, 

40 PR0333. PR0381. PRO720, PROS66. PRO840, PR0982. PRO** PROl 159, PR01358. PR01325. 
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PR01338. PR01434. PR04333. PRO4302. PRO4430 and PR05727 polypeptide. 

Various lechniques were employed for isolating the cDNA clones described below. A general 
description of the methods employed follows immediately hereafter, whereas the details relating the specific 
5 sequences isolated is recited separately for each native sequence. It is understood that the actual sequences of 
the PRO polypeptides are those which are contained within or encoded by the clone deposited with the ATCC - 
and that in the in event of any discrepancy between the sequence deposited and the sequence disclosed herein, 
the sequence of the deposit is the true sequence 

10 BCD Homology:. 

The extracellular domain (BCD) sequences (including the secretion signal sequence, if any) from 
about 950 known secreted proteins from the Swiss-Prot public database were used to search EST databases. 
The EST databases included public EST databases {e.g. t GenBank), a private EST database (UFE5EQ* Incyte 
Pharmaceuticals, Palo Alio. CA), and proprietary ESTs from Genentech. The search was performed using the 

15 computer program BLAST or BLAST2 [Altschul ct aL Methods in Enzymofogx: 266: 460-480 (1996)] as a 
comparison of the ECD protein sequences to a 6 frame translation of the EST sequences. Those comparisons 
resulting tn a BLAST score of 70 (or in some cases. 90) or greater that did not encode known proteins were 
clustered and assembled into consensus DNA sequences with the program "primp" (Phil Green. University of 
Washington. Seattle* Washington). 

20 Using various ESTs. drawing from both public and private databases, a consensus DNA sequence was 

assembled. Oligonucleotides were then synthesized to identify by PCR a cDNA library that contained the 
sequence of interest and for use as probes to isolate a clone encoding the particular native sequence PRO 
polypeptide identified herein. 

In order to screen several libraries for a source of a full-length, native sequence clone. DNA from the 

25 libraries was screened by PCR amplification with the PCR primer pair identified below. A positive library was 
then used to isolate clones encoding the particular native sequence PRO polypeptide using the probe 
oligonucleotide and one of the PCR primers. 

RNA for construction of the cDNA libraries was isolated from various human tissue libraries, 
including. e.g.. fetal lung, fetal liver, fetal brain, small intestine, smooth muscle cells, etc. The cDNA libraries 

30 used to isolated the cDNA clones were constructed by standard methods using commercially available reagents 
such as those from Invurogen, San Diego, CA The cDNA was primed with oligo dT containing a Not! site, 
linked with blunt to Sail hemucinased adaptors, cleaved with NotI, sized appropriately by gel electrophoresis, 
and cloned in a defined orientation into a suitable cloning vector (such as pRKB; pRKSB is a precursor of 
pRK5D that does not contain the Sfil site; see. Holmes et a/.. Science. 253:1278-1280 (1991)) in the unique 

35 Xhol and NotI sites. The clones were sequenced using known and readily available methodology. 

Amylase yeast screen: 

I- Preparation of oligo dT primed cDNA library 

mRNA was isolated from various tissues (e.g.. such as those indicated above under the ECD homology 
40 procedure) using reagents and protocols from Invitrogen, San Diego, CA (Fast Track 2). This RNA was used to 
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generate so oligo dT primed cDNA library in the vector pRK5D using reagenxs and protocols from Life 
Technologies, Gaithersburg. MD (Super Script Plasmid System). In this procedure, the double st randed cDNA 
was sized to greater than 1000 bp and me Sail/Not! Imkered cDNA was cloned into XhoI/NotI cleaved vector. 
pRK5D is a cloning vector that has an sp6 transcription initiation site followed by an SCI restriction enzyme 
5 site preceding the XhoI/NotI cDNA cloning sites. 

2. Preparation of random primed cDNA library 

A secondary cONA library was generated in order to preferentially represent the S* ends of the 
primary cDNA clones. Sp6 RNA was generated from the primary library (described above), and this RNA was 

0 used to generate a random primed cDNA library in the vector pSST-AM Y.O using reagents and protocols from 
Life Technologies (Super Script Plasmid System, referenced above). In this procedure the double stranded 
cDNA was sized to 500-1000 bp, linkered with blunt to NotI adaptors, cleaved with Sfil. and cloned into 
Sfil/Notl cleaved vector. pSST-AMY.O is a cloning vector that has a yeast alcohol dehydrogenase promoter 
rjreceding the cDNA cloning sites and the mouse amylase sequence (the mature sequence without the secretion 

5 signal) followed by the yeast alcohol dehydrogenase terminator, after the cloning sites. Thus. cDNAs cloned 
into this vector that are fused in frame with amylase sequence will lead to the secretion of amylase from 
appropriately transtected yeast colonies. 



3. Transformation and Detection 

20 DNA from the library described in paragraph 2 above was chilled on ice to which was added 

clectrocompetem DH10B bacteria (Life Technologies, 20 ml). The bacteria and vector mixture was then 
electroporated as recommended by the manufacturer. Subsequently. SOC media (Life Technologies. 1 ml) was 
added and the mixture was incubated at 37«C for 30 minutes. The transformants were then plated onto 20 
standard 150 mm LB plates containing amptcillm and incubated for 16 hours (37°Q. Positive colonies were 

25 scraped ofi" the plates and the DNA was isolated from the bacterial pcUet using standard protocols, e.g.. Cisco- 
gradient. The purified DNA was then carried on to the yeast protocols below. 

The yeast methods were divided into three categories: (1) Transfbrmation of yeast with the 
plasmid/cDNA combined vector, (2) Detection and isolation of yeast clones secreting amylase: and (3) PCR 
amplification of the insert directly from the yeast colony and purification of the DNA for sequencing and 

30 further analysis. 

The yeast strain used was HD56-SA (ATCC-90785). This strain has the following genotype: MAT 
alpha, ura3-52, leu2-3, Ieu2-U2, bis3-tl. his3-15, MAL\ SUC\ GAL*. Preferably, yeast mutants can be 
employed that have deficient post-translational pathways. Such mutants may have translocation deficient 
alleles in sccl 1 , sec72.sec62. with truncated secl\ being most preferred. AhenBtrvely. antagonists (including 
amisense nucleotides and/or ligands) which interfere with the normal operation of these genes, other proteins 
implicated in this post translation pathway (e.g.. SEC61p, SEC72p, SEC62p, SEC63p, TDJlp or SSAlp-4p) or 
the complex formation of these proteins may also be preferably employed in combination with the amylase- 
expressing yeasL 

Tnmsformation was performed based on the protocol outlined by Gierx ct aL. NucL Add. Res,. 2D: 1425 
(1992). Transformed cells were then inoculated from agar into YEPD complex media broth (100 ml) and 
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grown overnight at 30«C. The YEPD broth was prepared as described in Kaiser « al. Methods in Teas, 
Genetics, Cold Spring Harbor Press. Cold Spring Harbor. NY. p. 207 (19*4). The' overnight culture was then 
diluted to about 2 x 10* cells/ml (appro*. OD w - 0.1) into fresh YEPD broth (500 ml) and regrown to I x 10 7 
celk/ral (approx. ODmo-0.4-0.5). 

The cells were then harvested and prepared for transformation by transfer into GS3 rotor bottles in a 
Sorval GS3 rotor at 5.000 rpm for 5 minutes, the supernatant discarded, and then resuspended into sterile 
water, and centrimged again in 50 ml falcon tubes at 3.500 rpm in a Beekman GS-6KR centrifuge. The 
supernatant was discarded and the cells were subsequently washed with LiAc/TE (10 ml 10 mM Tris-HCI. 1 
mM EDTA pH 7.5. 1 00 mM L120OCCH3). and resuspended into LiAc/TE (2 J ml). 

Transformauon took place by mixing the prepared cells (100 pi) with freshly denatured single 
stranded salmon testes DNA (Lofstrand Labs. Gaithersburg, MD) and transforming DNA (I pg, vol. < 10 pi) 
in microfoge tubes. The mixture was mixed briefly by vortex ing, then 40% PEG/TE (600 pi 40% polyethylene 
grycoWOOO, 10 mM Tris-HCI, 1 mM EDTA 100 mM Li2Ac, pH 7.5) was added. This mixture was gently 
mixed and incubated at 30°C while agitating for 30 minutes. The cells were then heat shocked at 42°C for 15 
nunutcs. and the reacuon vessel centnluged in a microfoge at 12.000 rpm tor 5-10 seconds, decanted and 
resuspended imo TE (500 pi. 10 mM Tris-HCI. I «M EDTA pH 7.5) followed by recentrifogation. The cells 
were men diluted into TE (1 ml) and aliquots (200 pi) were spread onto the selective media previously 
prepared in 150 mm growth plates (VWR). 

Alternatively, instead of multiple small reactions, the transformauon was performed using a single, 
large scale reaction, wherein reagent amounts were scaled up accordingly. 

The selective media used was a synthetic complete dextrose agar lacking uracil (SCD-Ura) prepared as 
described in Kaiser ei al.. Method, in Yeast Genetics. Cold Spring Harbor Press. Cold Spring Harbor. NY, p. 
208-210 (1994). Transformams were grown at 30°C for 2-3 days. 

The detection of colonies secreting amylase was performed by including red starch in the selective 
growth media. Starch was coupled to the red dye (Reactive Red-120. Sigma) as per the procedure described 
by Biely e,al..Ami Biochem.. 172:176-179 (1988). The coupled starch was incorporated into the SCD-Ura 
agar plates at a final concemranon of 0.15% (w,vj. and was buffered with potass.um phosphate to a pH of 7.0 
(50-100 mM final concentration). 

The positive colonies were picked and streaked across fresh selective media (onto 150 mm plates) in 
order to obtain well isolated and identifiable single colonies. Well isolated single colonies positive for amylase 
secretion were detected by direct incorporation of red starch imo buffered SCD-Ura agar. Positive colonies 
were detennined by their ability to break down starch resulting in a clear halo around the positive colony 
visualized directly. 

Isolation and sequencing by standard techniques identified a yeast EST fragment which served as the 
35 basis for additional database mining as described below. 

4. Assembly 

The yeast EST fragment identified above was used to search various expressed sequence tag (EST ) 
databases. The EST databases included public EST databases (e.g.. GenBank, Merck/Wash U) and a 
propriety EST DNA database (LIFESEQ* Incyte Pharmaceuticals. Palo Alto. CA). The search was 
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performed using the computer program BLAST or BLAST2 (AltshuJ el aL Methods in Enzymology 
266:460-430 (1996)) as a comparison of the ECD protein sequences to a 6 frame translation of the EST 
sequence. Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not 
encode known proteins were clustered and assembled into, consensus DNA sequences with the program 
"phrap" (Phil Green. Uiuversny of Washington, Seattle, Washington). 

A consensus DNA sequence was assembled relative to other EST sequences using phrap. The 
consensus DNA sequence was extended using repeated cycles of BLAST and phrap to extend the consensus 
sequence as far as possible using the sources of EST sequences discussed above as well as EST sequences 
proprietary to Genentech. 

Based on this consensus sequence, ougonucleotides were synthesized: I) to identify by PCR a cDNA 
library that contained the sequence of interest, and 2) for use as probes to isolate a clone encoding the 
particular PRO polypeptide. In order to screen several libraries for a full-length clone. DNA from the libraries 
was screened by PCR amplification, as per Ausubel et aL Current Protocols in Molecular Biology, with the 
PCR primer pair. A positive library was then used to isolate clones encoding the gene of interest using the 
1 5 probe oligonucleotide and one of the primer pairs. 

RNA for construction of the cDNA libraries was isolated from various human tissues. The cDNA 
libraries used to isolate the cDNA clones were constructed by standard methods using cornmercially available 
reagents such as those from Invitrogen. San Diego, CA The cDNA was primed with oligo dT containing a 
Not! site, linked with blunt to Sail hemtkinased adaptors, cleaved with NotL sized appropriately by gel 
electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such as pRKB or pRKD: 
pRK5B is a precursor of pRK5D that does not contain the Sfil site: Holmes et aL Science. 253: 1278-1280 
( 199 1 )) in the unique Xhol and NotI sites. 
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Signal algorithm: 

A proprietary signal sequence finding algorithm developed by Genentech. Inc was used upon 
Expressed Sequence Tags (ESTs) and on clustered and assembled EST fragments from public (e.g.. Cenflank) 
and/or private fLifeseq\ Incyre Pharmaceuticals, inc.. Palo Alto. CA) databases. The signal sequence 
algorithm computes a secretion signal score based on the character of the DNA nucleotides surrounding the 
first and optionally the second methionine codecs) (ATG) at the 5'-cnd of the sequence or sequence fragment 
under consideration The nucleotides following the first ATG must code for at least 35 unambiguous amino 
acids without any stop codons. If the first ATG has the required amino acids, the second is not examined. If 
neither meets the requirement, the candidate sequence is not scored. In order to determine whether the EST 
sequence contains an authentic signal sequence, the DNA and corresponding amino acid sequences 
surromuimg the ATG codon are scored using a set of seven sensors (evaluation parameters) known to be 
associated with secretion signals. 

The above procedure resulted in the identification of EST sequences which were compared to a 
variety of expressed sequence tag (EST) databases which included public EST databases (e.g.. GenBank) and a 
proprietary EST DNA database (UFESEQ* Incyte Pharmaceuticals, Palo Alto. CA). The homology search 
was performed using the computer program BLAST or BLAST2 (AltshuJ et aL. Methods in Enzymology 
266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater 
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that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the 
program -phrap" (Phil Green, University of Washington, Seattle, Washington). This resulted in the 
identification of additional EST sequences which either corresponded to full-length clones, which were 
examined and sequenced or served as a template for the creation of cloning oligonucleotides which were then 
used to screen various tissue libraries resulting in isolation of DNA encoding a native sequence PRO 
polypeptide. 

A. Isolation of cDNA clones Encoding Human PRQ2QQ ( UNOl 74) 

Probes based on an expressed sequence tag (EST) identified from the Incyte Pharmaceuticals database 
due to homology with VEGF were used to screen a cDNA horary derived from the human glioma cell line 
G61. Screening may be conducted in a manner similar to the procedure disclosed elsewhere in this application. 
In particular, Incyte Clone -INC1302516" was used to generate the following four probes: 
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y - A CTTCTCACTGTCCATAAOOG.r (SEQIDNOJ) 

5'-GAACTAAAGAGAACCGATACCATTTTCTGGCCAGGTTGTC-3' (SEQ ID NO-4) 

y-CACCACAGCGTTTAACCAGG-3' (SEQ m N£ _ 

5*-ACAACAGGCACAGTTCCCAC-3' (SEQ ID NO: 6) 

Nine positives wens identified and characterized. Three clones contained (he fall coding region and 
were identical in sequence. Partial clones were also identified from a fetal lung hW and were identical with 
Ihe glioma-derived sequence with the exception of one nucleotide change which did not alter the encoded 
amino acid. 

For mammalian protein expression, the entire open reading frame (ORF) was cloned into a 
CMV-based expression vector. An epitope-tag (FLAG. Kodak) and Hisudine-tag (His8) were insened 
between the ORF and stop codon. UNQ174-His8 and UNQI74-FLAG were transfectcd into human embryonic 
tadney 293 cells by SuperFeet (Qiagen) and pulse-labeled for 3 hours with [»S)m«bjonme and ["CJcysteine 
Both epitope-tagged proteins co-migrate when 20 microhm of 15-fold concentrated senna-free conditioned 
med,um were electmphoresed on a polyacrylamide gel (Novex, in sodium dodecvl sulfa* sample buffer 
(SDS-PAGE). The UNQl74-IgG expression plasmid was consulted by cloning the ORF in front of the 
human Fc (IgG) sequence. 

The UNOl 74-! B G plasmid was co^ransfected with BaculogoW Baculovirus DNA (Phanningen) using 
Lipofcctin (GibcoBRL) into 10 s Sf9 cells grown „ Hink's TNM-FH medium (JRH Biosciences) supplemented 
wnh )0% feul bovine serum. Cells were incubated for 5 days at 28°C. The supernatant was harvested and 
subsequently used for the fits, viral amplification by infecting Sf9 cells a, an approximate multiplicity of 
infection (MOD of ,0. Cells were incubated for 3 days, then supernal harvested, and expression of the 
recommnan, plasmid detemnned by binding of I ml of supematent to 30 pi of Protein-A Sephamse CL-.B 
beads (Pharmacia) followed by subsequent SDS-PAGE analysis. The first amplification supernatant was used 
.0 mfect a 500 ml spinner culture of S<9 cells grown in ESF-921 medium (Expression Systems LLC) at an 
approbate MOI of 0... Cells were treated as above, except harvested supernatant was sterile filtered. 
Spectfic protem was purified by binding to Protein-A Sepharose 4 Fast Flow (Phamacia) column. 

The entire nucleotide sequence of the identified clone DNA29101 is shown in Figure I (SEQ ID 
NO:.). Clone DNA29101 (SEQ ID NO:l ) contains a single open reading frame with an apparent translation 
■nutation site a, nucleotide resides 285-287 and endmg at the stop codon (TAG) found a, nucleotide positions 
1320-1322 (Figure 1. SEQ ID NO:l). as indicated by bolded underline. The predicted PRO200 polypeptide 
precursor (Le.. UNQ174. SEQ ID NO:2) is 345 amino acids in length, has a calculated molecular weight of 
39029 daW a pi of 6.06 and is shown in Figure 2 (SEQ ID NO:2). Potential N-g.ycosylation sites are at 
amino acid residues 25, 54 and 254. CUB domains are at amino acid residues 52-65, 118-125 and 260-273. 

A cDNA containing DNA encoding UNQ174 (SEQ ID NO:2) has been deposited with the ATCC on 
March 5. 1998 and has been assigned deposit number 209653. 

B< Isolation Of cDNA clones Encn dfaig Humm, Pt»02Q4 (UNOl 78) 

An expressed sequence tag (EST) DNA database (UFESEQ", Incyte Pharmaceuticals, Palo Aim, CA) 
was searched and an EST was identified. Human fetal retina cDNA libraries were screened with PCR 
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oligonucleotide primers and confirmed by hybridization with synthetic oligonucleotide probe which was based 
upon the EST sequence. 
hybridization probe : 

5^GCATGCAGCAGCTGGACATTTGCGAGGGCTnTGCTGGCTG-3 f (SEQ ID NO:7) 
forward PCR primer 

S-CTGCTGCAGAGTTGCACGAACO' (SEQIDNO:8) 
reverse PCR primer 1 : 

5 1 -CAGTTGTTGTTGTCACAGAGAAG-3' (SE q rj) NO:9) 

reverse PCR primer 2 : 

S'-AGTTCGTGCAACTCTGCAGCAG-y (SEQ NO: 10) 

A cDNA clone was identified and sequenced in entirety. The entire nucleotide sequence of die 
identified clone DNA3087I is shown in Figure 3 (SEQ ID NO: 1 1). Clone DNA30871-1 157 (SEQ ID NO:l 1 ) 
contains a single open reading frame with an apparent translation initiation site at nucleotide positions 376-378 
and ending at the stop codon (TAA) found at nucleotide positions 1498-1500 (Figure 3: SEQ ID NO:l I), as 
indicated by bolded underline. The predicted PRO204 polypeptide precursor (i.e.. UNQ178. SEQ ID NO:I2) 
is 374 amino acids long, has a calculated molecular weight of 39.285 daltons. a pi of 6.06 and is shown in 
Figure 4. A cDNA containing DNA encoding UNQ178 (SEQ ID NO:l2> has been deposited with the ATTC 
on October 16, 1997 and has been assigned deposit number 209380. 

C Isolation of c DNA clones Encoding Human PRQ212 fUNQ lKfi) 

Use of the ECD homology procedure described above from a human fetal lung library resulted in the 
identification of the full-length DNA sequence for DNA30942 (Fig. 5; SEQ ID NO:l3) and the derived protein 
sequence UNQ 1 86 (Fig. 6; SEQ ID NO: 14). 

The PCR primers (forward and reverse) and probes used in the procedure were the following: 
forward primer: 5'-CACGCTGGTTTCTGCTTGG AG-3* (SEQ ID NO: 15) 

reverse primer: 5'- AGCTGGTGC ACAGGGTGTCATG-3' (SEQ ID NO: 1 6) 

hybridization probe: (SEQIDNO:17) 
y-CCCAGGCACCrrCTCAGCCAGCCAGCAGCTCCAGCTCAGAGCAGTGCCAGCCC-3' 

The entire nucleotide sequence of DNA30942 is shown in Figure 5 (SEQ ID NO:13). Clone 
DNA30942 (SEQ ID NO: 13) contains a single open reading frame with an apparent translation initiation site at 
nucleotide positions 101-103 and ending at the stop codon (TGA) at positions 1001-1003 (Fig. 5; SEQ ID 
NO:I3K as indicated in bolded underline. The predicted PR0212 polypeptide precursor of Fig. 6 (SEQ ID 
NO:14) is 300 amino acids long, has a calculated molecular weight of 32680 daltons and a pi of 8.70. It is 
believed that the PR0212 sequence of Fig. 6 (SEQ ID NO:14) lacks a transmembrane doniain. It is also 
believed that amino acids 1 to 215 of Fig. 6 (SEQ ID NO:14) represents an ECD which includes four cysteine 
rich domains (CRDs). A cDNA clone containing DNA30942 (SEQ ID NO: 1 3) has been deposited with ATCC 
(identified as DNA30942-1134) on September 16 J997 and has been assigned ATCC deposit no. 209254. 
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D. Isolation of cDNA clones Encoding Human PRQ216 (UNO 190) 

A procedure analogous to the one above for me isolation of PR0212 can be employed to isolate 
DNA33087 (SEQ ID NO:18) (Figure 7) which encodes the PR0216 polypeptide UNQI90 (SEQ ID 
NO:I9MFigure8). 

DNA33087 contains a single open reading frame with an apparent translation initiation site nucleotide 
residues 268-270 and ending at the stop codon (TAG) are residues 1531-1533 (Fig. 7. SEQ ID NO:18), as 
indicated by bolded underline The predicted PR0215 polypeptide precursor (/.e.. UNQI90, SEQ ID NO:l9) 
is 421 amino acids long, has a calculated molecular weight of 49492 'daltons and a pi of 531 (Fig. 8). 

Hydropathy analysis suggests the presence of a signal sequence at amino acid residues I to 20, 
tyrosine kinase phosphorylation sites at amino acid residues 268-274 and 300-306, and N-myristoyiation site 
residue 230-235. and leucine zippers at residues 146 to 167 and 217 to 238. Alternatively to iradiuonal 
isolation techniques, the DNA sequence is publicly available from GenBank as accession number AB0001 14 
which encodes Dayhoff protein AB000J 14 I. 

Alternatively still, the sequence is described in Ofmo et al.. Biochem. Biophys. Res. Commun. 228(2): 
41 MI4 (1996). A cDNA clone containing DNA33087 (identified as DNA33087-1 158) has been deposited 
with the American Type Culture Collection (ATCC) on September 16, 1997 and has been assigned ATCC 
Dep. No. 209381. 

£ Isolation of cDNA clones Encoding Human PRQ226 (UNC[7M)\ 

Use of the ECD homology procedure described above in a human fetal lung library resulted in the 
identification of the full-length DNA sequence for DNA33460 (Figure 9; SEQ ID NO:20) and the derived 
native sequence protein UNQ200 (SEQ ID NO:2 1 ). 

DNA33460 contains a single open reading frame with an apparent translation initiation site at 
nucleotide residues 62-64 and ending at the stop codon (TGA) at residues 1391-1393 (Fig. 9: SEO ID NO: 20), 
as indicated by bolded underline. The predicted PR0226 polypeptide precursor <«.. UNQ200. SEQ ID 
NO:2 1 ) is 443 amino acids long, has a calculated molecular weight of 49,39.1 daltons, a pi of 4.82 and is shown 
in Figure 10 as UNQ200 (SEQ ID NO:21). A cDNA clones conu^ning DNA33460 (SEQ ID NO:20), 
designated as DNA33460-1 166. has been deposited with the ATCC on October 16, 1997 and has been assigned 
ATCC deposit number 209376. 

The ougonucieotide sequences used in the above procedure were the following: 
28722.p(OU488) 

5^-TCTCrrGGACATAGACGAGTGCCGCTACCGCTACTGCCAGCACCGC-y 
28722.f (OU489) 

5'-AGGACTGCCATAACTTGCCTG-3* 
28722J-COU490) 

5 , -ATAGGAGTTGAAGCAGCGCTGC-3' 



(SEQ ID NO: 22) 
(SEQ ID NO: 23) 
(SEQ ID NO: 24) 
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F. Isolation of cDNA clones Encoding Human PRO240 (LfN0214) 

Use of the ECD homology procedure described above in a human fetal liver library resulted in the 
isolation of the foil-length DNA sequence for DNA34387 (Figure 11; SEQ ED NO:25) and the derived native 
sequence protein UNQ214 (SEQ ID NO:26). 

The entire nucleotide sequence of DNA34387 is shown in Figure 1 1 (SEQ ID NO:25). The clone 
DNA34387 contains a single open reading fame with an apparent translation imtnoon site at nucleotide 
positions 12-14 and ending at the stop codon (TGA) at nucleotide positions 699-701 (Fig. 11; SEQ ID NO:25) t 
as indicated by bolded underline. The predicted PRO240 polypeptide precursor (/.«., UNQ214, SEQ ID 
NO:26) is 229 amino acids long, has a calculated molecular weight of 24.689 daltons. a pi of 7.83 and is shown 
in Figure 12. A cDNA clone containing DNA34387 (SEQ ID NO:25) lias been deposited with ATCC on 
September I6 f 1997 and is assigned ATCC deposit no. 209260. 

The PCR primers (forward and reverse) and hybridization probe synthesized for use in the above- 
described procedure were the following: 

forward PCR primer ^-TCAGCTCCACACTCTGATACTGCC-y <SEQIDNO:27) 
15 reverse PCR pnmer S'.TGCCTTrCTACGAGGCAGAGCTCC-r (SEQIDNO:28) 
hybridization probe: (SEQ 1D NO:29> 

5-GGACCCAGAAATGTGTCCTGAGAATGGATCTTGTGTACCTGATGGTCCAG-3' 

G. Isolation of cDNA clones Encoding Human PRQ235 fUNO209} 
20 Use of the ECD homology procedure described above in a human fetal liver library resulted in the 

isolation of the full-length DNA sequence for DNA35558 (Figure 13; SEQ ID NO30) and the derived 

PR0235 native sequence protein UNQ209 (Fig. 14, SEQ ID NO: 31). 

TTie entire nucleotide sequence of DNA35558 is shown in Figure 13 (SEQ ID NO:30). The 

DNA35558 clone shown in Figure 13 contains a single open reading frame with an apparent translation 
25 initiation site at nucleotide positions 667-669 and ending at the stop codon (TGA) at nucleotide positions 2323- 

2325. as indicated by bolded underline. The predicted PR0235 polypeptide precursor UNQ209. SEQ ID 

NO:3 1 ) is 552 amino acids long, has a calculated molecular weight of 6 1 .674 daltons and a pi of 6.95 (Figure 

14). A cDNA clone containing DNA35558 has been deposited with ATCC on October 16, 1997 and is 

assigned ATCC deposit no. 209374. 

Tlie PCR primers (forward and reverse) and hybridization probe synthesized for use in the above 
procedure were: 

forward PCR primer 5*-TGGAATACCGCCTXXTGCAG-3' (SEQ ID NO:32) 

reverse PCR primer y-CrrCTGCCCTTTGGAGAAGATGGC-S' (SEQ ID NO:33) 

hybridization probe: 

35 5'-GGACTCACTGGCCCAGGCCTITCAATATCACCAGCCAGGACG AT-3' (SEQ ID NO:34) 
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H. Isolation ofcDNA Clonic Fn^ing Human PRQ24S (UNQilu^ 

Use of the ECD homology procedure described above in a human fetal liver library resulted in the 
isolation of the full-length DMA sequence for DNA3S658 (Figure 15. SEQ ID N035) and the derived PR0245 
native sequence protein UNQ219 (Figure 16. SEQ ID NO:36). 

The PCR primers (forward and reverse) and hybridization probes synthesized for use with the above- 
described method were the following: 

forward PCR primer 5--ATCGTTGTGAAGTTAGTGCCCC-3' (SEQIDNO:37) 
reverse PCR primer 5*-ACCTGCGATATCCAACAGAATTG-3' (SEQ ID NO 38) 

bvbridization probe (SEQIDNO:39) 
5'-GGAAGAGGATACAGTCACTCTGGAAGTATTAGTGGCTCCAGCAGTTCC-3 , 

The entire nueleodde sequence of DNA35638 (SEQ ID NO:35) is shown in Figure IS. Clone 
DNA35638 contains a single open reading frame with an apparent translation initiation site at nucleotide 
positions 89-91 and ending at the stop eodon (TAG) at nucleotide positions 1025-1027 (Fig. 15; SEQ ID 
NO:35). The predicted PR0245 polypeptide precursor (i.e.. UNQ219, SEQ ID NO:36) is 312 amino acids 
long, has a calculated molecular weight of 34.554 dalums and a pi of 9.39 (Fig. 36). A clone containing 
DNA35638 (SEQ ID NO:35). designated as DNA35638-. 141. has been deposited with ATCC on September 
1 6. 1 997 and is assigned ATCC deposit no. 209265. 

Isolation of cDNA clones Encoding Human PRQ172 (UNQ146> 

Use of the ECD homology procedure described above in a human fetal kidney library resulted in the 
isolation of the full-length DNA sequence for DNA359I6 (Fig. 17; SEQ ID NO:40) and the derived PROI72 
native sequence protein UNQ146 (Fig. 1 8. SEQ ID NO:41). 

Clone DNA359I6 (SEQ ID NO:40) contains a single open reading frame with an apparent translation 
initiation site at nucleotide positions 38-40 and ending at the stop codon (TAA) at nucleotide positions 2207- 
2209. as indicated by bolded underline in Fig. .7. The predicted PRO.72 polypeptide precursor (i.c. 
UNQI46: SEQ ID NO:4l) is 723 amino acids long, has a calculated molecular weight of 78.055 Gallons and a 
pi of 6.17 (Fig. 18). A cDNA clone containing DNA359I6 (SEQ ID NO:40) has been deposited with ATCC 
on October 28. 1997 (designated as DNA35916-1 161) and has been assigned ATCC deposit no. 209419. 

The oligonucleotide sequences used in the above procedure were the following: 
28765.p(OLI633) 

y-AAATCTGTGAATTCAGTGCCATGGACCTGTTGCGGACGGCCCTTGCTT-3' (SEQ ID NO:42) 
28765.f(OLI644) 

5"-GG ATCTCG AGAACAG CTACTCC-3' (SEQIDNO:43) 
2876S.r(OLI645) 

5--TCGTCCACGTTGTCGTCACATG-3- (S£Q 
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J - Isolation of cDNA clones En coding Human PRQ273 iUNO240) 

Use of the ECD homology procedure described above in a human fetal kidney library resulted in the 
isolation of the full-length DNA sequence for DNA39523 (Fig. 19; SEQ ID NO:45) and the derived PR0273 
native sequence protein UNQ240 (Fig. 20, SEQ ID NO:46). 

The PCR primers (forward and reverse) and hybridization probe synthesized were the following: 
forward PCR primer: 5MrAGCGCCCTCCCCATGTCCCTG-3' (SEQn>NO:47) 
reverse PCR primer: 5--TCCCAACTGGTTTGGAGTTTTCCC-3- (SEQ ID NO:48) 

hybridization probe: 

y-CTCCGGTCAGCATGAGGCTCCTGGCGGCCGCTGCTCCTGCTGCTG-3' (SEQ ID NO:49) 

Clone DNA39S23 (SEQ ID NO:45) contains a single open reading frame win, an apparent translation 
initiation site at nucleotide positions 167-169 and ending at the stop codon (TAG) at nucleotide positions 500- 
502 (Figure 19), as indicated by bolded underline. The predicted PR0273 polypeptide precursor 
UNQ240. SEQ ID NO:46) is 1 1 1 amino acids long, has a calculated molecular weight of 13,078 daltons and a 
pi of 10.37 (Figure 20). A cDNA clone including DNA39523 (SEQ ID N045) has been deposited with 
1 5 ATCC on October 31. 1997 and is assigned ATCC deposit no. 209424. 

K- Isolation ofcDNA clones Encoding Human PRQ272 OJN0239) 

Use of the ECD homology procedure described above in a human fetal lung tissue in combination 
with an m vivo cloning procedure using the probe oligonucleotide and one of the primer pairs resulted in the 
identification of the full length DNA sequence for DNA40620 (Fig. 21, SEQ ID NO:50) and the derived 
PR0272 native sequence protein UNQ239 (SEQ ID NOJ1). 

The forward and reverse PCR primers and hybridization probes synthesized and used to isolate the 
PR0272 encoding DNA sequences were the following: 

forward PCR primer (.fl): 5'-CGCAGGCCCTCATGGCCAGG-3* (SEQ ID NO:52) 

25 forward PCR primer (.G): 5'-GAAATCCTGGGTAATTGG-3' (SEQ ID NO:53) 

reverse PCR prim er: S'-GTGCGCGGTGCTCACAGCTCATC-y (SEQ ID NO:54) 

hybridization probe: 

5*-CCCCCCTGAGCGACGCTCCCCCATGATGACGCCCACGGGAACTTC-3' (SEQ ID N055) 

Clone DNA40620 (SEQ ID NO:50) contains a single open reading frame with an apparent translation 
initiation site at nucleotide positions 35-37 and ending at the stop codon (TGA) at nucleotide positions 1020- 
1022 (Figure 21). as indicated by bolded underline. The predicted polypeptide precursor is 328 amino acids 
long (Rgure 22). has a calculated molecular weight of 37,493 daltons and a pi of 4.77. A cDNA clone 
containing DNA40620 (SEQ ID NO.50) has been deposited with ATCC on October 17. 1997 and is assigned 
ATCC deposit no. 209388. 
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L Isolation of cDNA clones En ding Human PRQ332 OJN0293) 

Use of the ECD homology procedure described above in a human fetal liver library resulted 
in the identification of the full-length DNA sequence for DNA40982 (Fig. 23, SEQ ID NO:56) and the derived 
PR0332 native sequence protein UNQ293 (Fig. 24, SEQ ID NO:57). 
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The PCR primers (forward and reverse) and hybridization probe synthesized for use in the above 
procedure were: 

5-^CATTGGCCGCGACACT7TGCC.3- (SEQ ID NO-.38) 

5'-OCGGCCACGCTCCTTOGAAATG-3' (SEQ m NQ . 59) 

5 5'-TGGAGGAGCTCAACCTCAGCTACAACCGCATCACCAGCCCACAGG-3" (SEQIDNO:60) 

The entire nucleotide sequence of DNA40982 (SEQ ID NO:56) is shown in Figure 23. Clone 
DNA40982 (SEQ ID NO:56) contains a single open reading frame with an apparent translation initiation site ax 
nucleotide positions 342-344 and ending at the stop codon (TAG) at nucleotide positions 2268-2270. as 
indicated in Figure 23 by bolded underline. The predicted PR0332 polypeptide precursor (i.e.. UNQ293. SEQ 
ID NO-.57. Fig. 24) is 642 amino acids long, and has a calculated molecular weight of 72,067. and a pi of 6.60. 
A cDNA clone containing DNA40982 (SEQ ID NCW6) (designated as DNA40982-1235) has been deposited 
with ATCC on November 7. 1997 and is assigned ATCC deposit no. 209433. 
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M- Isolation ofcDNA clones Encoding Human PROS26 (UNO330> 

Use of the ECD homology procedure described above in a human fetal liver library resulted in the 
identification of the full-length DNA sequence DNA44I84 (Fig. 25. SEQ ID NO:6I) and the derived PROS26 
native sequence protein UNQ330 (Fig. 26. SEQ ID NO:62). 

The PCR primers (forward and reverse) and hybridization probes synthesized were the following: 
forward PCR primer 5*-TGGCTGCCCTGCAGTACCTCTACC-3" (SEQIDNO:63) 
20 reverse PCR primer 5'-CCCTGCAGGTCATTGGCAGCTAGG-3' (SEQIDNO:64) 
hybridization probe : (SEQ ^ 

5--AGGCACTGCCTGATGACACCTTCCGCGACCTGGGCAACCTCACAC-3 1 . 

Clone DNA44I84 (SEQ ID NO:61) comains a single open reading frame with an apparent translation 
initiation sue at nucleotide pos.tions 514-516 and endin e at the stop codon (TGA) at nucleotide positions 1933- 
1935 (Figure 61). as indicated by bolded underline. The predicted PR0526 polypeptide precursor {i.c. 
UNQ330. SEQ ID NO:62) is 473 amino acids long (Figure 62). The UNQ330 (SEQ ID NO:62) protein shown 
in Figure 62 has an estimated molecular weight of about 50708 daltons and a pi of about 9.28. A cDNA clone 
containing DNA44I84 has been deposited with the ATCC on 26 March 1998 (under the designation 
DNA44184-13 19) and is assigned deposit number 209704. 

Analysis of UNQ330 (SEQ ID NO:62) revels that the signal peptide sequence is at about ammo acids 
1-26. A leucine zipper pattern is at about amino acids 135-156. A glycosaminoglycan attachment is at about 
amino acids 436439. N-giycosylation site, are at about amino acids 82-85. 179-182, 237-240 and 423426. A 
von Willebrand factor (VWF) type C domain(s) is found at about amino acids 41 M25. The skilled artisan 
can understand which nucleotides correspond to these amino acids based on the sequences provided herein. 

N- Isolation of cDNA clones Encoding Human PRQ7 01 fUNQ365) 

Use of the ECD homology procedure described above in a human fetal liver library insulted in the 
identification of the mil-length DNA sequence DNA44205 (Fig. 27. SEQ ID NO:66) and the derived PR0526 
native sequence protein UNQ365 (Fig. 28, SEQ ID NO:67). 
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The PCR primers (forward and reverse) and hybridization probe synthesized for use in the above 
procedure were: 

5* -GGCAAGCTACGGAAACGTCATCGTG-3' (SEQ ID NO:68) 

5^AAC<XCCGAG<XAAAAGATGGTCAC-3' (S EQ ID N069) 

5 5^TACCGGTGACCAGGCAGCAAAAGGCAACTATGGGCTCCTGGATCAGO» (SEQ ID NO:70) 

Clone DNA44205 (SEQ ID NO:66) coniains a single open reading frame (with an apparent translation 

initiation site at nucleotide positions 50-52 and ending at the stop codon (TAG) at nucleotide positions 2498- 

3000, as indicated by bolded underline in Figure 27. Tbt predicted PRO701 polypeptide precursor {l*. Fig. 

28. UNQ365, SEQ ID NO:67) is 816 amino acids long, and has a calculated molecular weight of 91.794 Da 
10 (pi: 5.88). A cDNA clone containing DNA44205 (SEQ ID NO:66) (designated as DNA44205-1 285) has been 

deposited with ATCC on March 31 . 1998 and is assigned ATCC deposit no. 209720. 

UNQ365 (SEQ ID NO:67) contains a potential signal peptide cleavage site at about amino acid 

position 25. There are potential N-glycosylation sites at about amino acid positions 83, 51 1, 716 and 803. The 

carboxylesterases rype-B signature 2 sequence is at about residues 125 to 135. Regions homologous with 
5 carboxylestcrase rype-B are also at about residues 54-74. 197-212 and 221-261. A potential transmembrane 

region corresponds approximately to amino acids 671 through about 700. The corresponding nucleic acids can 

be routinely determined from the sequences provided herein. 

O. Isolation ofc DNA clones Encoding Human PRQ361 (UNQ316) 

Use of the ECD homology procedure described above in combination with an in vivo cloning 
procedure using the probe oligonucleotide and one of the primer pairs in a human fetal kidney library resulted 
in the idermfication of the mil-length DNA sequence DNA45410 (Fig. 29. SEQ ID NO:71) and the derived 
PR0361 native sequence protein UNQ316 (Fig. 30. SEQ ID NO:72). 

The forward and reverse PCR primers and a hybridization probe were synthesized for use in the 
above-described method: 
forward PCR primer (.fl): 

5- AGGG AGG ATTATCCTTG ACCTTTG AAG ACC-3* (SE Q ID NO:73) 

forward PCR primer (.Q): 5* -GAAGCAAGTGCCCAGCTC-3' (SEQ ID NO:74) 

forward PCR primer (.0): 5* ^GGGTCCCTGCTCTTTGG-3' (SEQ ID NO:75) 

reverse PCR primer <.rl ): 5 f -CACCGTAGCTGGGAGCGCACTCAC-3* (SEQ ID NO:76) 

reverse PCR primer (t2): 5'-AGTCTAAGTCAAGCTCCC-3' (SEQ ID NO:77) 

hybridization probe : 

5*- GCTTCCTG AC ACT AACGCTGTCTGCTAGTCAGAATTGCCTCAAAAAGAG-3' (SEQ ID NO:78) 
Clone DNA45410 (SEQ ID NO:7!) contains a single open leading fame with an apparent translation 
initiation site at nucleotide positions 226-228 and ending at the stop codon (TAA) at nucleotide positions 1519- 
1S2I (Figure 29), as indicated by bolded underline. The predicted PR036I polypeptide precursor (U.. 
UNQ316. SEQ ID NO:72) is 431 amino acid* long (Figure 30). The native sequence PR036I protein shown 
in Figure 30 as UNQ316 has an estimated molecular weight of about 46810 and a pi of about 6.45. In addition, 
regions indicative of the aiginase family proteins are present at about residues F3 to VI 4 and again at D9 to 
TS7, while a transmembrane domain exists at about residues P380 to S409. A cDNA clone containing 
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DNA45410 (SEQ ID NO:7l) has been deposited with ATCC on February 5. 1998 and is assigned ATCC 
deposit no. 209621. 



10 



25 



P - Isolation of c DNA clones Encoding Human PRQ362 (UNQ3171 

Use of the ECD homology procedure described above m a human fetal brain Ubrary resulted m the 
isolation of the full-length DNA sequence DNA45416 (Fig. 31, SEQ ID NO:79) and the derived PR0362 
native sequence protein UNQ317 (Fig. 32. SEQ ID NO;80). 

The PGR primers (forward and reverse) and hybridization probe synthesized for use in the above 
procedure were: 

forward PCR primer I : 5^TATCCCTCCAATTGAGCACCCTGG-3' (SEQ ID NO:8l) 

forward PCR primer 2 : 5'-GTCGGAAGACATCCCAACAAG-3* (SEQIDNO:82) 

15 reverse PCR primer 1 : 5"-CTTCAC AATGTCGCTGTGCTGCTC- 3' <SEQIDNO:83) 

reverse PCR primer 2 : 5-AGCCAAATCCAGCAGCTGGCTTAC-3' (SEOIDNO:84) 

hybridization probe : 

5--TGGATGACCGGAGCCACTACACGTGTGAAGTCACCTGGCAGACTCCTCAT-3- (SEQ ID NO:85) 

Clone DNA45416 (SEQ ID NO:79) contains a single open reading frame with an apparent translation 
initiation site at noeleotide positions 119-121 and ending at the stop codon (TAA) at nucleotide positions 1082- 
1084 (Figure 31). as indicated by bolded underline. The predicted PR0362 polypeptide precursor (**.. 
UNQ317. SEQ ID NO:80) is 321 amino acids long (Figure 32). Tie UNQ317 protein (SEQ ID NO:80) shown 
in Figure 32 has an estimated molecular weight of about 35.544 daltons and a pi of about 8.51. Analysis of the 
UNQ3I7 polypeptide as shown in Figure 32 evidences the presence of a giycosaminoglycan attachment site at 
about amino acid 149 to about amino acid 152 and a transmembrane domain from about amino acid 276 to 
about amino acid 306. A cDNA clone containing DNA45416(SEQ ID NO:79) has been deposited with ATCC 
on February 5. 1 998 and is assigned ATCC deposit no. 209620. 
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0 Isolation ofcDNA clones Enc oding Human PRQ363 fUNQIlfi) 

Use of the ECD homology described above in a human fetal kidney library resulted in the isolation of 
«ne falMength DNA sequence DNA45419 (Fig. 33. SEQ ID NO:86) and the derived PR0363 native , 
protein UNQ318 (Fig. 34. SEQ ID NO:87). 

The PCR primers (forward and reverse) and hybridization probe synthesized for use in the above 
pr o ce dur e were 

forward PCR primer: 

5'-CCAGTGCACAGCAGGCAACGAAGC-3' (SEQ m ^ 

re verse PCR prune: 

5--ACTAGGCTGTATGCCTGGGTGGGC-3' (SEQIDNO:89) 
hybridization probe: 

r-GTATCTAC AAAGCATCG G CATGGTTG C AGG AGCAGTGACAGGC -3' (SEQIDNO:90) 
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Clone DNA45419 (SEQ ID NO:86) contains a single open reading frame with an apparent translation 
initiation site at nucleotide positions 190-192 and ending at the stop codon (TGA) at nucleotide positions 1309- 
1311 (Figure 33), as indicated by boJded underline. The predicted PR0363 polypeptide precursor {Lc. 
UNQ318, SEQ ID NO:87) is 373 amino acids long (Figure 34). The UNQ318 protein (SEQ ID NO:87) shown 
5 in Figure 34 has an estimated molecular weight of about 41,281 daltons and a pi of about 8.33. Analysis of the 
UNQ318 polypeptide as shown in Figure 34 evidences the presence of a transmembrane domain at about 
amino acid residue 221 to about residue 254. A cDNA clone containing DNA454I9 (SEQ ID NO:86) has been 
deposited with ATCC on February 5, 1998 and is assigned ATCC deposit no. 209616. 

10 R. Isolation of cDNA clones Encoding Human PRQ364 (UNQ319) 

Use of the ECD homology procedure described above in a human small intestine library resulted in 
the identification of an expressed sequence tag (EST) (Incyte EST No. 3003460) that encoded a polypeptide 
which showed homology to members of the rumor necrosis factor receptor (TNFR) family of polypeptides. 

A consensus DNA sequence was then assembled relative to the Incyte 3003460 EST in a manner 
15 similar to that used in the ECD homology procedure which resulted in the isolauon of the full-length DNA 
sequence DNA47365 (Fig. 35. SEQ ID NO:9l) and the derived PR0364 native sequence protein UNQ319 
(Fig. 36, SEQIDNO:92). 

The PCR primers (forward and reverse) and hybridization probes synthesized for use in the above- 
described screening procedure were: 
20 forward PCR primer (44825.fl): 5'-CACAGCACGCGGCGATGGG-3' (SEQIDNO:93) 
forward PCR primer (44825.CV 5*-GCTCTGCGTTCTGCTCTG-3* (SEQIDNO:94) 
forward PCR primer (44825.GITR.f) : 

S'-GGCACAGCACGGGGCGATGGGCGCGTTT-T (SEQ ID NO:95) 

reverse PCR primer (44825.rl ): 5 t -CTGGTCACTGCCACCTTCCTGCAC.3 , (SEQ ID NO:96) 

25 reverse PCR primer (44B25.r2): 5*-CGCTGACCCAGGCTGAG-3' (SEQIDNO:97) 
reverse PCR primer f44825,GITR.r) : 

5'-GAAGGTCCCCGAGGCACAGTCGATACA-T (SEQ ID NO:98) 
hybridization probe (44825.pl ) : 

5 , -GAGGAGTGCTGTTCCGAGTGGGACTGCATGTGTGTCCAGC-3 , (SEQ ID NO:99) 
30 hybridization probe (44825.GITR.P) : 

5'.AGCCTGGGTCAGCGCCCCACCGGGGGTCCCGGGTGCGGCC3' (SEQ ID NO:100) 

Clone DNA47365 (SEQ ID NO:9I) contains a single open reading frame with an apparent translation 
initiation site at nucleotide positions 121-123 and ending at the stop codon (TGA) at nucleotide positions 844- 
846 (Figure 35). as indicated by bolded underline. The predicted PR0364 polypeptide precursor (ie. 
UNQ3 19, SEQ ID NO:92) is 241 amino acids long (Figure 36). The UNQ3 19 (SEQ ID NO:92) protein shown 
in Figure 36 has an estimated molecular weight of about 26,000 daltons and a pi of about 6.34. A potential N- 
grycosyianon sites exists between amino acids 146 and 149 of the amino acid sequence shown m Figure 36. A 
putative signal sequence is from amino acids 1 to 25 and a potential transmembrane domain exists between 
acids 162 to 180 of the sequence shown in Figure 36. A cDNA clone containing DNA47365 
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deposited with ATCC on November 7. 1997 and is assigned ATCC 
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Isolation ofcDNA clones Encoding Human PRQ356 qjN0313)fNL4) 
An expressed sequence tag (EST) DNA database (UFESEQ* Incyte Pharmaceuticals, Palo AJto CA) 
wa, searched and an EST (#2939340) was identified which showed homology to human TE-2 LI and TTE-2 
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Based on the EST, a pair of PCR primers (forward and reverse), and a probe were synthesized: 
NL4.5-1: 5'-TTCAGCACCAAGGACAAGGACAATGACAACr-r (SEQ ID NO 103) 

NU^i: y-TGTGCACACTTGTCCAAGCAGTTCTCATTGTCO' (SEQ ID NO' 104) 

NL4.3-3: S'-GTAGTACACTCCATTGAGGTrGGO' (SEQ W N0: J0J) . 

Oligo dT primed cDNA libraries were prepared from uterus mRNA purchased from Clontech, Inc. 
(Palo Alto. CA, USA, catalog U 6537-1) in the vector pRK5D using reagents and protocols from Life 
Technolog.es, Gaithersburg, MD (Super Script Plasmid System). pRK5D is a cloning vector mar has an sp6 
transcription initiation site followed by an SGI restriction enzyme site preceding the XhoI/NotI cDNA cloning 
sues. The cDNA was primed with oligo dT containing a Not! site, linked with blunt to Sail hemikinased 
adaptors, cleaved with Noti. sized to greater than 1 000 bp appropriately by gel electrophoresis, and cloned in a 
defined orientation into XhoI/Notl-cleaved pRKSD. 

In order to screen several libraries for a soume of a full-length clone. DNA from the libraries wa* 
screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
bobte clones encoding the PRD356 gene using the I»rabe oligonneleotide and of Ae PCR primers. 

DNA sequencing of the clones isolated as described above gave a full-length DNA sequence encoding 
.he native sequence PR0356 (NL4) (/.,.. DNA47470. SEQ ID NO:10.) and the derived PR0356 protein 
sequence UNQ3 1 3 (SEQ ID NO: 1 02). 

Tne entire nucleotide sequence of DNA47470 is shown in Figure 37 (SEQ ID NO: 101) Clone 
DNA47470 (SEQ ID NO:»01) coraaras a single open reading frame w„h an apparent translation initiation site 
at nucleotide positions 2.5-217, and a TAA stop codon at nucleotide positions .038-1040. as indicated by 
boided underlie. The predicted PR0356 polypeptide is 346 amino acids long (U.. UNQ313 (SEQ ID 
Nai02). has a calculated molecular weigh, of 40.0.8 dahons and a pi of 8.,9. A cDNA clone containing 
DNA47470 (SEQ ID NO:,0.) has been deposued with ATCC on October 28. .997 and is assigned ATCC 
^—^i no. 209422. 



T " Isolation ofcDNA clones Enenrfing Ham™ p»r>< ;31 (UNQ337) 

Use of me ECD homology procedure identified above in a human fetal brain library resulted in the 
isolation of the mU-length DNA sequence DNA483.4 (Fig. 39. SEQ ID NO: 106) and the derived PR053. 
native sequence protein UNQ332 (Fig. 40, SEQ ID NO: 107) . 

Tfeiraprimerafforwartarrirevem) 
forward PCR primer S*-CTXjAGAACGCGCCTGAAACTCTO-3' (SEQ ID NO 108) 

reverse PCR primer 5*-AOCGTTCTCATTGACATCOGCG-y (SEQ ID NO: 1 09) 
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hybridization probe : (SEQ ED NO: 1 10) 

5*-TTAGTTGCTCCATTCAGGAGGATCTACCCTTCCTCCTGAAATCCGCGGAA-3* 

Clone DNA48314 (SEQ ID NO:106) contains a single open reading frame with an apparent 
translation initiation site at nucleotide positions 171-173 and ending at the stop codon (TGA) at nucleotide 
positions 2565-2567 (Figure 39), as indicated by bolded underline. The predicted PR0531 polypeptide 
precursor (lc, UNQ332, SEQ ID NO:107) is 789 amino acids long. Tne UNQ332 protein (SEQ ID NO: 107) 
shown in Figure 39 has an estimated molecular weight of about 87552 daltons and a pi of about 4.84. A clone 
containing DNA483 14 (SEQ ID NO: 106) has been deposited with the ATCC on 26 March 1998, and has been 
assigned deposit number 209702. 

Analysis of the UNQ332 amino acid sequence of SEQ ID NO: 107 reveals a cadherin extracellular 
repeated domain signature at about amino acids 122-132, 231-241, 336-346. 439-449 and 549-559. An 
ATP/GTP-binding site motif A (P-loop) is found at about amino acids 285-292 of SEQ ID NO: 107. N- 
glycosylation sites arc found at least at about amino acids 567-570, 786-790, 4 18-421 and 336-339. the signal 
peptide is at about amino acids 1-26, and the transmembrane domain is at about amino acids 685-712 of SEQ 
ID NO: 107. 

u - Isolation of cDNA clones Encoding Human PRQ533 fUNQ334> 

The EST sequence accession number AF007268, a murine fibroblast growth factor (FGF-15) was 
used to search various public EST databases [e.g., GenBank. Dayhor£ eic). The search was performed using 
the computer program BLAST or BLAST2 [Altschul et ai. Methods in Ensymology. 266:460-480 (1 996)] as a 
comparison of the ECD protein sequences to a 6 frame translation of the EST sequences. The search resulted 
in the identification of GenBank EST AA220994, which has been identified as stratagenc NT2 neuronal 
precursor 937230. 

Based on this sequence, oligonucleotides were synthesized: 1 ) to identify by PCR a cDNA library that 
contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding 
sequence. In order to screen several libraries tor a source of a full-length clone. DNA from the libraries was 
screened by PCR amplification, as per AusuM et aL Current Protocols in Molecular Biology, with the PCR 
primer pair. A positive library was then used to isolate clones encoding the PR0533 gene of interest by an in 
vivo cloning procedure using the probe oligonucleotide and one of the PCR primers. 

RNA for construction of the cDNA libraries was isolated from human fetal retina. The cDNA 
Ubraries used to isolated the cDNA clones were constructed by standard methods using commercially available 
reagents (e.g.. Invitrogen, San Diego, CA; Clontech, cfc.) The cDNA was primed with oligo dT containing a 
NotI site, linked with blunt to Sail hemikinased adaptors, cleaved with NotI, sized appropriately by gel 
electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such as pRKB or pRKD; 
pRK5B is a precursor of pRK5D that does not contain the Sfil site; Holmes et aL, Science. 253: 1278-1280 
(1991 )) in the unique Xhol and NotI sites. 

A cDNA clone was sequenced in its entirety. The full length nucleotide sequence DNA49435 (SEQ 
ID NO: 1 1 1) is shown in Figure 4 1 . Clone DNA49435 (SEQ ID NO: 1 1 1) contains a single open reading frame 
with an apparent translation initiation site at nucleotide positions 46*466 and ending at the stop codon (TAA) 
at nucleotide positions 649-651, as indicated by bolded underline in Fig. 41. The predicted PR0533 
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polypeptide precursor (ie.. UNQ334. SEQ ID NOH2) is 216 amino acids long, has a calculated molecular 
weight of 24.003 daltons and a pi of 6.99. Clone DNA49435-1219 has been deposited with ATCC (under me 
designation DNA49435- 1219) on November 2 1. 1997 and is assigned ATCC deposit no. 209480. 

The oligonucleotide sequences used in the above procedure were the following: 
FGFI5.fi 5--ATCCGCCCAGATC3GCTACAATGTGTA-3' (SEQ ID NO:! 13) 

FGFI5.p: 5'-GCCTCCCGGTCTCCCTGAGCAGTGCCAAACAGCGGCAGTGTA-3' (SEQ ID NO: 1 14) 

FGF15.T 5--CCAGTCCGGTGACAAGCCCAAA-3- (SEQIDN01I5) 
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V - Isolarion of cDNA clones E ncoding Human PROIQg3 OJNQ54fl) 

Use of the amylase yeast screen procedure described above on tissue isolated from human fetal 
kidney tissue resulted in an EST sequence which served as the template for the creation of the 
oligonucleotides below and screening as described above in a human fetal kidney library resulted in the 
isolation of the full length DNA sequence DNA50921 (Fig. 43, SEQ ID NO: 1 16) and the derived PRO1083 
native sequence protein UNQ540 (SEQ ID NO: 117). 

The PCR primers (forward and reverse) and hybridization probes synthesized for use in the above 
procedure were the following: 

forward primer (43422.fl): 5-GGCATTGGAGCAGTGCTGGGTG-3' (SEQ ID NO:l 18) 

20 forward primer (43422J2): 5'-AGAGCAACTCAGACAGCG-3' (SE Q ID NO: 1 19) 

reverse primer (43422^1): y-TGGAGGCCTAGATGCGGCTGGACG-r (SEQ ID NO:120) 

reverse primer (43422^2): 5--CGAGGAGACCATCAGCAC-3' (SE Q ID N0121) 

hybridization probe: (4342Zpl): (SEQIDNO:122) 
5'-CCCAAACATCCTGCTTCTGCAACCACTTGACCTACTTTGCAGTGC-3' 

Clone DNA50921 (SEQ ID NO: 1 16) contains a single open reading frame with an apparent 
translation initiation site at nucleotide positions 154-156 and ending a, the stop codon (TAG) at nucleotide 
positions 2233-2235 (Figure 43). as mdicated by bolded underline. The predicted PROI083 polypeptide 
precursor (/.*. UNQ540. SEQ ID NO:117. Figure 44, is 693 amino acids long. The UNQ540 (SEQ ID 
NO:ll7) protein shown in Figure 44 has an estimated molecular weight of about 77738 and a pi of about 
8.87. A clone containing DNA5092I has been deposited with the ATCC on May 12. 1998 and has been 
assigned deposit number 209859. 

Analysis of the amino acid sequence UNQ540 (SEQ ID NO: 1 17) reveals the putative signal peptide is 
at about amino acids 1 -25. transmembrane domains are at about amino acids 382-398. 402-420. 445-468. 473- 
491, 519-537. 568-590 and 634-657. a nucrobodies C-terminal targeting signal a, about amino acids 69.-693. 
CAMP- and cGMP-dependent protein kinase phosphorylation sites at about amino acids 198-201 and 370-373. 
N-glycosyiation sites at about amino acids 39-42, 148-151, 171-174. 234-237. 303-306. 324-227 and 341-344 
and a G-protein coupled receptor family domain at about amino acids 475-504. 

W - Isolation ofcDNA clone. Enro ling Human PBQg65 OJNQ4341 
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Use of the amylase yeast screen procedure described above on tissue isolated from human fetal kidney 
tissue resulted in an EST sequence which served as the template for the creation of the oligonucleotides below 
and screening as described above in a human fetal kidney library resulted in the isolation of the full length 
DNA sequence DNA53974 (Fig. 45, SEQ ID NO: 123) and the derived PR0865 native sequence protein 
UNQ434 (SEQ ID NO: 124). 

The PCR primers (forward and reverse) and hybridization probes synthesized for use in the above 
procedure were the following: 

forward primer (48615.fl ): S'-AAGCTGCCGGAGCTGCAATG-r (SEQ ID NO: 125) 



5 , -AGACAGCGGCACAGAGGTGCTTCTGCCAGGTTAGTGGTTACTTGGATGAT-3 , 

Clone DNA53974 (SEQ ID NO: 123) contains a single open reading frame with an apparent 
translation initiation site at nucleotide positions 173.175 and ending at the stop codon (TAA) at nucleotide 
posiuoa* 1577-1579 (Figure 45). as indicated by bolded underline. The predicted PROB65 polypeptide 
precursor (/.&. UNQR65, SEQ ID NO: 124) is 468 amino acids long. The UNQ434 (SEQ ID NO: 124) protein 
shown in Figure 46 has an estimated molecular weight of about 54,393 and a pi of about 5.63. A clone 
containing DNA53974 (SEQ ID NO: 123) has been deposited with the ATCC on April 14, 1998 and has been 
assigned deposit number 209774. 

Analysis of the amino acid sequence UNQ434 (SEQ ID NO: 124) reveals the putative signal.pepude at 
about ammo acid residues 1-23, potential N-glycosylarion sites at about amino acids residue 280 and at about 
384. a potential amidation site from about amino acid residue 94 to about residue 97. giycosaminoglycan 
attachment sites from about amino acid residue 20 to about 23 and from about residue 223 to about residue 
226. an aminotransferase class-V pyridoxyl-phosphate amino acid sequence block from about amino acid 
residue 216 to about residue 222 and an amino acid sequence block similar to that found in the interleukin-7 
protein from about amino acid residue 338 to about residue 343. 

X. Isolation of cDNA clones Encoding Human PRO770 (UNQ408) 

A public expressed sequence tag (EST) DNA database (Mercl^ashington University) was searched 
with the full-length murine m-FIZZi DNA (DNA53517), and an EST, designated AA524300 was identified, 
which showed homology with the m-FIZZl DNA. 

The foil-length clone corresponding to the EST AA524300 was purchased from Incyte (Incyte 
Pharmaceuticals, Palo Alto, CA) and sequenced in entirety. 

The entire nucleotide sequence of the resulting PROT7f>*ncoding full-length clone is shown in 
Figure 47. This foil-length clone, designated DNA54228 (SEQ ID NO: 133), contains a single open reading 
frame with an apparent translation initiation site at nucleotide positions 100-102 (Fig.47; SEQ ID NO:133) and 
ending at the stop codon (TGA) at residues 433-435, as indicated by bolded underline. The predicted PRO770 
polypeptide precursor (including a putative signal sequence of 20 amino acids) (Le., UNQ408, SEQ ID 



forward primer: (48615J2): 5'-TTGCTTCTTAATCCTGAGCGC-3' 
forward primer (48615J3): S'-AAAGGAGGACTTTCGACTGCO' 
reverse primer. (48615.M): 5^AGAGATTCATCCACTGCTCCAAGTCG-3* 
reverse primer (48615.r2): 5 , -TGTCCAGAAACAGGCACATATCAGC-3* 
hybridization probe: (43422.p 1 ): 



(SEQ ID NO: 126) 
(SEQ ID NO: 127) 
(SEQ ID NO: 1 28) 
(SEQ ID NO: 129) 
(SEQ ID NO: 130) 
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NOtKM) is 1 1 1 amino acids long, ha* a calculated molecular weight of 11.730 daltons and a pi of 7.82. Based 
upon its homology to m-FIZZl (50%, using the ALIGN software), the protein is believed to be die human 
homolog of m-FIZZl. and has been designated h-FIZZl. A cDNA clone containing DNA54228 (SEQ ID 
NO:133) has been deposited with ATCC and is assigned ATCC deposit no. 209801. 
Identification and cloning of m.R77i (QNA53517) 

Mouse asthma model Female Balb/C mice. 6 to 8 weeks of age, were separated into two 
experimental groups: controls and asthmatics. The asthmatic group was immunized intraperitoneally with 10 
ug ovalbumin - 1 mg alum, while the control group was not. Two weeks later, mice were exposed daily to an 
aerosol of 10 mg/mi ovalbumin in PBS aerosolized with a UltraNeb nebulizer (DeVilbiss) at the raw of 2 
ml/min for 30 min each day. for 7 consecutive days. One day after the last aerosol challenge, whole blood, 
serum and bronchoalveolar lavage (BAL) samples were collected and the lungs were harvested and preserved 
for histological examination, immuno-histochemistry and in situ hybridization. 

OH electrophoresis of BAL snmptoExaminadon of the BAL samples by gel electrophoresis on a 16% 
Tricme gel shows mat a low molecular weight protein is expressed in the BAL samples from asthmatic mice 
but not in the BAL samples from control mice. Thfc low molecular weight protein was termed m-FIZZl and 
was seen to co-migrate with a 8300 Dalton marker protein. 

Partial protein sequence The protein of interest was transferred upon a PVDF membrane and 
sequenced by Edman degradauon. This sequence served as a template for die preparation ofvanous cloning 
oligos as described below. 

Partial cDNA sequence We designed two degenerate oligonucleotide PCR primers corresponding 
to the putative DNA sequence for the first 7 and the last 7 amino acids of the partial protein sequence.. 
Ohgo#l: 

5--ACA AAC GCC TGA VGA RAC NAT HGA RAT-3' - (SE0 ID NO: 133) 

Oligo n 2: 

25 5--TGG TGC ATG CGG RTA RTT NGC NGG RTT-3' (SE q m NO: , 36) 

cDNA prepared from the lungs of normal mice was used as a template for the PCR reaction which 
yielded an 88 bp product. This 88 bp product contained 54 known base pairs, encoding the PCR primers, and 
34 novel base pairs, and encoded another partial mFIZZ- 1 sequence. 

Full length cDNA clone This second partial sequence was used to design primers which were 
ulumatdy successful in obtaining the full length FIZZ clone (DNA535 1 7) by RT-PCR of mouse lung pol^A)' 
RNA. 
Oligo #3: 

5--ACA AAC GCG TGC TGG AGA ATA AGG TCA AGG-3' (SEQ n> NO: l37) 

This oligo was used as an RT-PCR primer in combination with 5" and 3' ampiimers from Ckmtech. 
35 Oligo #4: 

* ACT AAC GCG TAG GCT AAG GAA CTT CTT GCC-3' (SEQ id NO: 138) 

This oligo was used as an RT-PCR primer in enmbmaritm with oligo d(T>. 
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A public expressed sequence tag (EST) DNA databases (Merck/Washington University) was searched 
with the full-length murine ro-FIZZl DNA (DNA 53517) described above and the EST W42069 was 
identified. 

The full-length clones corresponding to the EST fragment W420G9 was obtained bom Incyie 
5 Pharmaceuticals (Palo Alto. California), and sequenced in the entirety, which ultimately resulted in the 
identification of the fall length nucleotide sequence DNA54231 (SEQ ID NO:139). 

The nucleotide sequence corresponding to the fall length, native sequence PR0769 clone is shown is 
Figure 49. This clone, designated DNA 5423 1 (SEQ ID NO: 139) contains a single open reading frame with an 
apparent translation initiation site at nucleotide positions 75-77 and ending at the stop codon (TGA) at residues 
10 41 7-419. as indicated by boWed underline (Fig. 49). The predicted PR0769 polypeptide precursor (including a 
signal sequence of 10 amino acidsX<>.. UNQ407, SEQ ID NO.140) is 1 14 amino acids long, has a calculated 
molecular weight of 12,492 daltons and a pi of 8.19. Based on its homology to m-FIZZl (34%, using the 
ALIGN software) the protein was designated m-FIZZ3. A clone containing DNA54231 (designated 
DNA5423 1-1366) has been deposited with ATCC on April 23, 1998 and has been assigned ATCC deposit no. 
15 209802. 

2- Isolation of cDNA clones Encoding Human PRQ7RR (UNQ430) 

Use of the ECD homology procedure identified above resulted m the identification of the partial 
length EST sequence 2777282. Further analysis of the corresponding full-length sequence resulted in the 
20 identification of DNA56405 (SEQ ID NO:!4l) and the derived native sequence PR0788 protein UNQ430 
(SEQ ID NO: 142). 

Clone DNA56405 (SEQ ID NO: 14 1) contains a single open reading frame with an apparent 
translation initiation site at nucleotide positions 84-86 and ending at the stop codon (TAG) at nucleotide 
positions 459-461 (Figure 51). as indicated by bolded underline. The predicted native sequence PR0788 

25 polypeptide precursor (i.e. UNQ430, SEQ ID NO: 142) is 125 ammo acids long (Figure 52). has a calculated 
molecular weight of 13.1 15 daltons and a pi of 5.90. The UNQ430 (SEQ ID NO: 142) protein shown in Figure 
52 has an estimated molecular weight of about 13115 and a pi of about 5.90. A clone containing DNA56405 
(SEQ ID NO: 142) has been deposited with the ATCC on May 6, 1998 and has been assigned deposit 
number209849. In the event of a discrepancy in the nucleotide sequence of the deposit and the sequences 

30 disclosed herein, it is understood that the deposited clone contains the correct sequence. It is farther 
understood that the methodology of sequencing for the sequences provided herein are based on known 
sequencing techniques. 

Analysis of UNQ430 (SEQ ID NO:52) shown in Figure 52 reveals a signal peptide at about amino 
acids I- 1 7 and an N-glycosylation site is at about amino acids 46. 

35 

AA. Isolation of c DNA clones En coding Huma n PRQ1 1 14 (UNQ557) 

Use of the amylase yeast screen procedure described above on tissue isolated from human fetal 
kidney tissue resulted in an EST sequence which served as the template for the creation of the 
oligonucleotides below and screening as described above in a human breast carcinoma library resulted in the 
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isolation of the full length DNA sequence DNA57033 (Fig. 53, SEQ ID N0:I43) and the derived PR01114 
native sequence protein UNQ557 (Fig. 54, SEQ ID NO.144). 

The PCR primers used in the isolation screen described in the previous paragraph were: 
forward primer (48466.fi): 5 , -ACGCTTCGCTGCGACTAGACCTC-3' (SEQIDNO : i45) 

reverae primer (48466.H): 5--CCAGGTCGGGTAAGGATGGTTGAG-3' (SEQIDNO:I46) 

hybridization probe: 48466.pl): 

5--T1TCTACGCATTGATTCCATGTTTGCTCACAGATGAAGTGGCCATTCTGC-3- (SEQ ID NO I47) 

Clone DNA57033 (SEQ ID NO: 143) contains a single open reading frame with . apparent 
translation initiation site at nucleotide positions 250-252 and ending at the stop codon (TAG) found at 
nueleotidepositions • '83-1185 (Figure 53. SEQ ID N0:I43). as indicated by bolded underline. Thepredicted 
PROI 1 14 polypeptide precursor (Lc. UNQ557. SEQ ID NO:144) is 31 1 amino acids long, has a calculated 
molecular weight of approximately 35.076 daltons and an estimated pi of approximately 5.04. Analysis of the 
foil-length PROI 1 14 sequence shown i„ Figure 54 (SEQ ID NO: 144) evidences the presence of the following: 
a signal peptide from about amino acid I to about amino acid 29. a transmembrane domain from about amino 
acid 230 to about amino acid 255. potential N- g lycosyfation sites from about amino acid 40 to about amino 
acid 43 and from about amino acid 134 ,o about amino acid 137. an amino acid sequence block having 
homology to tissue factor proteins from about ammo acid 92 to about amino acid 119 and an amino acid 
sequence block having homology to integrin alpha chain proteins from about amino acid 232 to about amino 
acid 262. A cDNA clone containing DNA57033 (SEQ ID NO:l43) has been deposited with ATCC on May 
20 27. 1998 and is assigned ATCC deposit no. 209905. 

AB - Isolation ofcDNA clones Enc oding Human PRQIQQ7 OrNQ401) 

Use of the ECD homology procedure described above resulted in the identification of an EST 
sequence designated Merck EST T70513. which was derived from human liver tissue (clone 83012 from 
hbrary 341) was farther examined. The corresponding full-length clone was further examined and sequenced, 
resulting in the isolation of the full-length DNA sequence DNA57690 (Fig. 55. SEQ ID NO: 145) and the 
derived PRO 1 007 native sequence protein UNQ491 (Fig. 56. SEQ ID NO- 146). 

Clone DNA57690 (SEQ ID NOUS) contains a single open reading frame with an apparent 
translation initiation site a, nucleotide positions .6-18 and ending at the step codon (TO A) a, nucleotide 
posmons 1054-1056 (Figure 55), as indicated by bolded underline. The predicted PRO1007 polypeptide 
precursor .</.*, UNQ491. SEQ ID NO:146) is 346 amino acids long (Figure 56). has a calculated molecular 
weight of 35.971 daltons and a pi of 8.17. The UNQ491 (SEQ ID NO: 1 46) protein shown in Figure 56 has an 
esnmated molecular weigh, of about 3597. daltons and a pi of about 8.17. A cDNA clone containing 
DNA57690 (SEQ ID NO: 145) has been deposited with the ATCC on 9 June 1998. and has been assigned 
35 deposit number 209950. 

Analysis of the amino acid sequence of UNQ491 (SEQ ID NO: 1 46) reveals the putative signal peptide 
a, about anuno acid residues 1-30. a transmembrane domain « about amino acid residues 325-346 N- 
glycosylation sites at about amino acid residues 118. 129. 163. 176. 183 and 227 and a Ly-6/u-Par domain 

proteins at about amino acid residues 17-36 and ?oo -yr> -n ... 

™ The corresponding nucleotides of the amino acids 

presented herein can be routinely determined given the sequences provided herein. 
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AC. Isolation of cDNA clones Encoding Human PR0 11 84 (UN0598) 

Use of the signal algorithm procedure described above resulted in the identification of Incyte EST 
1428374 which was derived from an ileum tissue library (39, SINTBST01 ). Further examination of the ftiU- 
length clone corresponding to this sequence resulted in the isolation of the full-length DNA59220 (Fig. 57, 
SEQ ID NO: 147) and the derived PROl 184 native sequence protein UNQ598 (Fig. 58, SEQ ID NO:148). 

UNQ598 (SEQ ID NO: 148), as shown in Figure 58 exhibits an apparent translation initiation site at 
nucleotide positions 106-108 and ending at the stop codon (TGA) found at nucleotide positions 532-534, as 
indicated by boJded underline. The predicted PR01184 polypeptide precursor (Lc. UNQ598, SEQ ID 
NO:148) is 142 amino acids long, has a calculated molecular weight of approximately 15690 daltons and an 
estimated pi of arjproximately 9.64. Analysis of UNQ598 (SEQ ID NO:148) evidences the presence of a signal 
peptide at about amino acids 1-38. A cDNA clone containing DNA59220 (SEQ ID NO: 147) has been 
deposited with the ATCC on 9 June 1998, and has been assigned deposit number 209962. It is understood that 
the deposited clone has the actual sequences and that representations are presented herein. 

AD. Isolation of cDNA clones Encoding Human PRO 1031 (UNQ516) 

Use of the ECD homology procedure described above resulted in the identification of the EST 
sequence Merck W74558 (clone 344649). The corresponding rull-lcngth clone was examined and sequenced 
resulting in the isolation of DNA sequencing gave the rull-length DNA sequence DNA59294 (Fig. 59. SEQ ID 
NO:l49) and the derived PROI03I native sequence protein UNQ516 (Fig. 60, SEQ ID NO:150). 

Clone DNA59294 (SEQ ID NO: 149) contains a single open reading frame with an apparent 
translation initiation site at nucleotide positions 42-44 and ending at the stop codon (TGA) at nucleotide 
positions 582-584 (Figure 59), as indicated by bolded underline. The predicted PRO103I polypeptide 
precursor (re., UNQ516, SEQ ID NO:150) is 180 amino acids long (Figure 60). The UNQ516 protein shown 
in Figure 60 has an estimated molecular weight of about 20437 and a pi of about 9.58. Clone DNA59294 
(SEQ ID NO: 149) has been deposited with the ATCC on May 14. 1998 and has been assigned deposit number 
209866. Regarding the sequence, it is understood that the deposited clone contains the correct sequence, and 
the sequences provided herein arc based on known sequencing techniques. 

Analysis of the amino acid sequence of UNQ516 (SEQ ID NO: 150) reveals the putative signal peptide 
at about amino acid residues 1-20, an N-giycosylation site is at about amino acid residue 75. A region having 
sequence identity with IL-17 is at about amino acid residues 96-180. The corresponding nucleotides can be 
routinely determined given the sequences provided herein. 

AE. Isolation of cDNA clones E ncoding Hiim an PRQ1346 (UNO701) 

Use of the ECD homology procedure described above in a human fetal kidney library resulted in the 
isolation of the full-length DNA sequence DNA59776 (Fig. 61, SEQ ID NO:151) and the derived PR01346 
native sequence protein UNQ701 (Fig. 62, SEQ ED NO: 152). 

The PCR primers (forward and reverse) and hybridization probe used in the isolation of DNA59776 
(SEQ ID NO: 151) were the following: 

forward PCR primer (45668.fl): 5 , -CACACGTCCAACCTCAATGGGCAG-3 , (SEQ ID NO:153) 
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reverse PCR primer (45668.r 1 ): 5'-GACCAGCAGGGCCAAGGACAAGG-3* (SEQ ID NO: ! 54) 

hybridization P robc (45668.p 1 ): (SEQ n> NO: 155) 

5 , <nTCTCTGAGATGAAGATCCGGCCGGTCCGGGAGTACCGCTTAG-3 f 

Clone DNA59776 (SEQ ID NO: 15 1) contains a single open reading frame with an apparent 
translation initiation site at nucleotide positions 1-3 (ATG), and an apparent stop codoa (TAG) at nucleotide 
positions 1384-1386 (TAG). The predicted PR01346 polypeptide precursor (i.e.. UNQ701. SEQ ID N0:I52) 
is 461 amino acids long. The protein contains an apparent type II transmembrane domain at amino acid 
positions from about 31 to about 50, fibrinogen beta and gamma chains C-terminal domain signature at about 
amino acid positions 409-421 and a leucine ripper patterns at about amino acid positions 140-161. 147-168, 
154-175 and 161-182. 

A cDNA clone containing DNA59776, designated as DNA59776-1600. has been deposited with 
ATCC on August 18. 1998 and is assigned ATCC deposit no. 203128. The UNQ701 (SEQ ID NO:152) 
protein shown in Figure 62 has an estimated molecular weight of about 50744 daltons and a pi of about 6.38. 

AF. Isolation of cDNA clones Encoding Human PRO! 155 (UNQ585) 

Use of the signal algorithm procedure described above resulted in the identification of Incyte EST 
2858870 which was derived from an ileum tissue library (39. SININOT03). Funher examination of the full- 
length clone corresponding to this sequence resulted in the isolation of the full-length DNA sequence 
DNA59849 (Fig. 63. SEQ ID NO:156) and the derived PROU55 native sequence protein UNQ585 (Fig. 64, 
SEQIDNO:157). 

The UNQ585 (SEQ ID NO: 1 57) polypeptide shown in Figure 64 contains a single open reading frame 
with an apparent translation initiation site at nucleotide positions 158-160 and ending at the stop codon (TAA) 
found at nucleotide positions 563-565. as indicated by bolded underline. The predicted PROl 155 polypeptide 
precursor (i.e.. UNQ585. SEQ ID N0:157) is 135 antino acids long, and signal peptide appears at about amino 
acids residues I to about 18. a leucine zipper pattern appears at about amino acid residues 43 to 64 and a 
tachykinin family signature appears at about amino acid residues 86 to about 91. UNQ585 (SEQ ID NO: 157) 
has a calculated molecular weight of approximately 14833 daltons and an estimated pi of approximately 9.78. 
A cDNA clone containing DNA59849 (SEQ ID NO:156), designated as DNA59849-I504. has been deposited 
with ATCC on June 16. 1998 and is assigned ATCC deposit no. 209986. 

AG. Isolation of c DNA clones Encoding Human PRO1250 (UNQ633) 

Use of the signal algorithm procedure described above resulted in the identification of an EST cluster 
sequence from the incyte database, designated Incyte EST cluster sequence no. 56523. This sequence was then 
compared to a variety of various EST databases as oescribed under the signal algorithm procedure above, and 
further resulted in the identification of Incyte EST 3371784. Further examination and sequencing of the full- 
length clone corresponding to this EST sequence resulted in the isolation of the full-length DNA sequence 
DNA60775 (Fig. 65, SEQ ID NO:l58) and the derived PROI250 native sequence protein UNQ633 (Fig. 66\ 
SEQE>NO:159). 

Clone DNA60775 (SEQ ID NO:I58) cooauns a single open reading frame with an apparent 
translation initiation site at nucleotide positions 74-76 and ending at the stop codon (TAG) at nucleotide 
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posiuons 2291-2293 (Figure 65). The predicted PRO1250 polypeptide precursor {Le.. UNQ633. SEQ ID 
NO:159) is 739 amino acids long (Figure 66). The UNQ633 (SEQ ID NO:159) protein shown in Figure 66 has 
an estimated molecular weight of about 82^63 daltons and a pi of about 7.55. Analysis of UNQ633 (SEQ ID 
NO:159) evidences the presence of the following: a type II transmembrane domain from about amino acid 
residues 61 to about 80, a putative AMP-bmding domain signature sequence from about amino acid residue 
314 to about 325. and potential N-glycosylauon sites from about amino acid residues 102 to about 105, from 
about amino acid residues 588 to about 591 and from about amino acid residues 619 to about 622. A cDNA 
clone containing DNA60775 (SEQ ID NO:158) has been deposited with the ATCC on September I. 1998 and 
is assigned ATCC deposit no. 203173. 

AH. Isolation of cDNA clones Encoding Human PRQ13I2 (UNQ678) 

An EST (DNA55773) was identified in a human fetal kidney cDNA library using a yeast screen, that 
preferentially represents the 5* ends of the primary cDNA clones. Based on the DNA55773 sequence, 
oligonucleotides were synthesized for use as probes to isolate the full-length DNA sequence DNA6I873 (Fig. 
67. SEQ ID NO: 1 60) and the derived PRO 1 3 1 2 native sequence UNQ678 (SEQ ID NO: 1 6 1 ). • 

The full length DNA61R73 clone shown in Figures 67 (SEQ ID NO: 160) contains a single open 
reading frame with an apparent translation initiation site at about nucleotide posiuons 7-9 and ending at the 
stop codon (TGA) found aT about nucleotide positions 643-645, as indicated by bolded underline. The 
predicted PR013 12 polypeptide precursor (i.e.. UNQ678, SEQ ID NO: 161) is 212 amino acids long. UNQ678 
(SEQ ID NO: 161) has a calculated molecular weight of approximately 24,024 daltons and an estimated pi of 
approximately 6.26. Other features include a signal peptide al about amino acids 1-14: a transmembrane 
domain at about amino acids 141-160. and potential N-glycosylatkm sites at about amino acids 76-79 and 93- 
96. A clone containing DNA6I873 (SEQ ID NO:160) has been deposited with the ATCC on August 18. 1998, 
under the designation DNA61 873-1312* and has been assigned deposit number 203 132. 

Al. Isolation of cDNA clones Encoding Human PROM 92 (UNO606) 

Use of the ECD homology procedure described above in a human fetal liver library resulted in the 
isolation of the full-length DNA sequence DNA62814 (Fig. 69, SEQ ID NO:162) and the derived PR01192 
native sequence protein UNQ606 (Fig. 70, SEQ ID NO: 163). 

The PCR primers (forward and reverse) and hybridization probe used in the isolation of DNA62814 
(SEQ ID NO: 162) were the following: 

forward PCR primer (35924.fl): S'-CCGAGGCCATCTAGAGGCCAGAGC-T (SEQ ID NO:164) 
reverse PCR primer (35924.r 1 ): 5 f -ACAGGCAGAGCCAATGGCCAGAGC-3' (SEQ ID NO: 1 65). 
hybridization probe (35924.pl): (SEQ ID NO: 166). 

S'-GAGAGGACTGCGGGAGTn-GGGACCmGTGCAGACCTGCrcATG^ 

Clone DNA628 14 (Fig. 69, SEQ ID NO: 1 62) contains a single open reading frame with an apparent 
translation initiation site at nucleotide positions 121-123, and an apparent stop codon (TAA) at nucleotide 
positions 766-768, as indicated by bolded uiideriine. The predicted PR01192 polypeptide precursor (Le., 
UNQ606, SEQ ID NOM63) is 215 amino acids long. The UNQ606 (SEQ ID NO:163) polypeptide] 
shown in Figure 70 has a signal peptide at about amino acids 1-21; a transmembrane contain at about i 
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acids 153-176; potential N-giycosylatkro sites at about amino acids 39-42 and 1 18-121: and homology with 
myelin PO proteins at about amino acids 27-68 and 99-128. The UNQ606 (SEQ ID NO: 1 63) shewn m Figure 
70 has an estimated molecular weight of about 24.484 Daltoas and a pi of about 6.98. 

A cDNA clone containing DNA628I4 (SEQ ID N0.162). designated as DNA62814-1521, was 
deposited with the ATCC on August 4, 1 998, and is assigned ATCC deposit no. 203093. 

AJ. Isolation of cDNA clones Encoding Human PRQ1246 (UN 06 30) 

Use of the signal algorithm procedure described above resulted in the identification of an EST cluster 
sequence from the Incyte database, designated Incyte EST cluster sequence no. 56853. This sequence was then 
compared to a variety of various EST databases as described under the signal algorithm procedure above, and 
further resulted in the identification of Incyte EST 248 1345. Further examination and sequencing of the full- 
length clone corresponding to this EST sequence resulted in the isolation of the full-length DNA sequence 
DNA64g85 (Fig. 71, SEQ ID N0167) and the derived PR01246 native sequence protein UNQ630 (F.g. 72, 
SEQIDNOI68). 

Clone DNA64885 (SEQ ID NO:167) contains a single open reading frame with an apparent 
translation initiation site at nucleotide postuons 119-121 and ending at the stop codon fTGA) at nucleotide 
positions 1727-1729 (Figure 71). as indicated by bolded underline. The predicted PROI246 polypeptide 
pteewsor (Lc. UNQ630. SEQ ID NO.I68) is 536 amino acids long (Figure 72). has an estimated molecular 
weight of about 61.450 daltons and a pi of about 9.17. Analysis of UNQ630 (Fig. 72. SEQ ID NO: 168) reveals 
the following: a signal peptide from about amino acid 1 to about amino acid 15. potential N-glycosylation sites 
from about amino acid 108 to about amino acid 111, from about amino acid 1 66 to about amino acid 169. from 
about amino acid 193 to about ammo acid 196. from about amino acid 262 to about amino acid 265. from 
about amino acid 375 to about amino acid 378. from about amino acid 413 to about amino acid 416 and from 
about amino acid 498 to about amino acid SOI and amino acid sequence blocks having homology to sulfatase 
proteins from about amino acid 286 to about ammo acid 315. from about amino acid 359 to about amino acid 
369 and from about amino acid 78 to about ammo acid 97. A cDNA coiuaining DNA64885 (SEQ ID 
NOM67). designated DNA64g85-1529. has been deposited with ATCC on November^. 1998 and is assigned 
ATCC deposit no. 203457. 

AK. Isolation of cDNA clones Encoding Huma n PRO 1 283 fUNQ6S3) 

Use of me ECD homology procedure described above m a human breast tumor tissue library resulted 
in the isolation of me full-length DNA sequence DNA65404 (Jig. 73, SEQ ID NO: 169) and the derived 
PR01283 native sequence protein UNQ653 (Fig. 74, SEQ ID NO 1 70). 

The PCR primers (forward and reverse) and hybridization probes used in the isolation of DNA65404 
(SEQ ID NO: 169) were the following: 

forward PCR primer (28753.fl): 5'-GGAGATGAAGACCCTGTTCCTG-3' (SEQ ID NO 171) 

forward PCR primer (28753 Jl I): y-GGAGATGAAGACCCTGTTCCTGGGTG-3' (SEQIDNO:172) 
reverse PCR primer (28753x1): y-GTCCTCCGGAAAGTCCTTATC-3' (SEQ ID NO: 173) 

reverse PCR primer (28753x1 1): 5'-GCCTAGTCTTCGGGAACGCAGCTTC-3' (SEQ ID NO:174) 
hybridization probe (28753.pl): 



(SEQIDNO:175) 
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^^AGGGACCTGGTACCTGAAGGCCATGGTGGTCGATAAGGACnTCCGGAC-y 

hybridization probe (28753.pl i): (SEQ ID NO: 176) 

5 , -CTGTCCTTCACCCTGGAGGAGGAGGATATCACAGGGACCTGGTAC-3 , 

Clone DNA65404 (SEQ ID NO: 1 69) contains a single open reading frame with an apparent 
translation initiation site at nucleotide positions 45-47 and ending at the stop codon (TAG) at nucleotide 
positions 555-557 (Figure 73), as indicated by bolded underline. The predicted PR01283 polypeptide 
precursor (i.e., UNQ653, SEQ ID NO: 170) is 170 amino acids long (Figure 74). The UNQ653 (SEQ ID 
NO: 170) protein shown in Figure 74 has an estimated molecular weight of about 19,457 daltons and a pi of 
about 9.10. Analysis of the UNQ653 (SEQ ID NO.170) evidences the presence of the following: a signal 
peptide from about amino acid 1 to about amino acid 17. A cDNA clone containing DNA65404 (SEQ ID 
NO:169), designated DNA65404-1551. has been deposited with ATCC on September ?, 1998 and is assigned 
ATCC deposit no. 203244. 

/ 

AL. Isolation of c DNA clones Encoding Human PROl 195 (UNQ608) 

Use of the signal algorithm procedure described above resulted in the identification of an EST cluster 
sequence 32204 from the Incyte database. This sequence was then compared to a variety of various EST 
databases as described under the signal algorithm procedure above, and further resulted in the identification of 
Incyte EST352980. Further examination and sequencing of the full-length clone corresponding to this EST 
sequence resulted in the isolation of the ruU-length DNA sequence DNA65412 (Fig. 75, SEQ ID NO:l77) and 
the derived PROl 195 native sequence protein UNQ608 (Fig. 76 t SEQ ID NOJ78). 

The full length clone DNA65412 (SEQ ID NO: 177) contains a single open reading frame with an 
apparent translation initiation site at nucleotide positions 58-60 and ending at the stop codon (TAG) found at 
nucleotide positions 511-513 (Figure 75). as indicate by bolded underline. The predicted PROl 195 
polypeptide precursor Ue., UNQ608. Figure 76, SEQ ID NO: 1 78) is 151 amino acids long, has a calculated 
molecular weight of 17 227 daltons and a pi of 533. Analysis of UNQ608 (SEQ ID NO: 178) reveals a signal 
sequence at about amino acids 1-22. a calculated molecular weight of approximately 17277 daltons and an 
estimated pi of approximately 5.33. A cDNA done containing DNA65412 (SEQ ID NO: 177). designated as 
DNA654 12-1523, was deposited with the ATCC on August 4 ? 1998 and is assigned ATCC deposit no. 203094. 

AM. Isolation ofcDNA clones Encoding Human PRQ1343 (UNQ698) 

Use of the amylase yeast screen procedure described above on tissue isolated fix>m human smooth 
muscle cell tissue resulted in an EST sequence which served as the template for the creation of the 
oligonucleotides below and screening as described above in a human smooth muscle cell tissue library 
resulted in the isolation of the Mill length DNA sequence DNA66675 (Fig. 77, SEQ ID NO: 179) and the 
derived PR01343 native sequence protein UNQ698 (Fig. 78, SEQ ID NO:180). 

The oligonucleotide probes employed were as follows: 
fawan!PairjrimcT(48921.fl) 5 1 -CAATATGCATCTTGCACGTCTGG-3' (SEQ IDNO:I81) 
reverse PCR primer (48921 jl) y-AAGCTTCIXrrGCTTCCTTTCCTGC-3' (SEQ ID N0:182) 
hybridization probe (4892 1 .pi) 

y-TGACCCCATTGAGAAGGTCATTGAAGGGATCAACCGAGGGCTG-3' (SEQ ID NO:183) 
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Tbe &U length clone DNA66675 (SEQ ID NOM79) contains a single open reading fame with an 
apparent translation nritiatira site « nuc^ 

812-014 (Figure 77), as indicted by bolded underline. The predicted PR01343 polypeptide precursor (te, 
UNQ698. SEQ ID NOM80. Fig. 78) is 247 amino acids long, has a calculated molecular weight of 

5 approximately 25.335 daltons and an estimated pi of approximately 7.0. Analysis of the UNQ698 

shown in Figure 78 (SEQ ID NO: 180) evidences the presence of the following: a signal peptide from about 
amino acid I to about amino acid 25 and a homologous region to ctrcumsporozoite repeats from about amino 
acid 35 to about amino acid 225. A cDNA clone containing DNA66675 (SEQ ID NO: 1 79), designamd 
DNA6667S-1587, has been deposited with ATCC on September 22. 1998 and is assigned ATCC deposit no 
10 203282. 

Alternatively, a comparison of the yeast EST sequence isolated from the amylase screen above was 
screened against various EST databases, both public and private (e.g.. see ECD homology procedure, above) 
resulting in the identification of Incyte EST clone no. 4701 148. Further analysis and sequencing of the 
corresponding full-length clone resulted in isolation of the DNA66675 sequence (SEQ ID NO:l79) shown in 
15 Figure 77. 

AN. Isolation of cDNA clones Enco ding Human PRQ1418 (UNQ732f 

Use of the signal algorithm procedure described above resulted in the identification of an EST cluster 
sequence 10698 (Incyte duster 121480). This sequence was then compared to a variety of various EST 
dantbases (including those derived from a placenta tissue library) as described under the signal algorithm 
procedure above, and tunher resulted in the identification of Incyre EST 1 306026. Further examination and 
sequencmg of the full-length clone corresponding to this EST sequence resulted in the isolation of the full- 
length DNA sequence DNA68864 (Fig. 79. SEQ ID NO:I84) and the derived PR014I8 native sequence 
protein UNQ732 (Fig. 80, SEQ ID NO: 185). 

The full length clone shown in Figure 79 (DNA68864, SEQ ID N0.184) contains a single open 
reading frame with an apparent translation initiation site at nucleotide positions 138-140 and ending at the stop 
codon (TAA) found at nucleotide positions 1 188-1 190. a, indicated by bolded underline. The predicted 
PR014I8 polypeptide precursor (i.e.. UNQ732. SEQ ID NO:l85) is 350 amino acids long with a signal 
pepude at about amino acids 1-19. , calculated molecular weight of approximately 39003 daltons and an 
esomated pi of approximately 5.59. A cDNA clone containing DNA68864 (SEQ ID NO: 184). designated as 
DNA68864-1629 was deposited with the ATCC on September 22, 1998 and is assigned ATCC deposit no 
203276. 

AO. Isolation of cDNA clones Enco ding Humnn ppqi3 87 fUN0722) 

Use of the signal algorithm procedure described above resulted in the identification of an EST cluster 
sequence 10298. This sequence was then compared to a variety of various EST databases as described under 
the signal algorithm procedure above, and further resulted in the identification of Incyte EST3507924. Further 
exarnrnatioo and sequencing of the full-length clone corresponding to this EST sequence resulted in the 
notation of the ruil-length DNA sequence DNA68872 (Fig. 81, SEQ ID NO: 186) and the derived PR01387 
native sequence protein UNQ722 (Fig. g2. SEQ ID NO:187). 
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Clone DNA68872 (SEQ ID NO: 186) contains a single open reading frame with an 
translation initiation site at nucleotide positions 76-78 and ending at the stop codon (TGA) at nucleotide 
positions 1258-1260 (Figure 81). as indicated by bolded underline. The predicted PR01387 polypeptide 
precursor (i.e.. UNQ722, SEQ ID NO:I87) is 394 amino acids long. The UNQ722 (SEQ ID N0:I87) protein 
shown in Figure 82 has an estimated molecular weight of about 44,339 daltons and a pi of about 7.10. 
UNQ722 (SEQ ID NO: 187) further contains a signal peptide from about ammo acid residues 1 to about residue 
19, a transmembrane domain from about residue 275 to about residue 296. potential N-glycosylation sites at 
about residues 76, 231, 302, 307 and 376 and amino acid sequence blocks having homology to myelin pO 
protein from about amino acid residue 210 to about residue 239 and from about amino acid residue 92 to about 
residue 121. A cDNA clone containing DNA68872, designated as DNA68872-1620. has been deposited with 
the ATCC on August 25, 1998 and is assigned ATCC deposit no. 203160. 



AP. Isolation of cDNA clones Encoding Human PRO1410 (UNQ72B) 

Use of the signal algorithm procedure described above resulted in the identification of an EST cluster 
sequence 98502. This sequence was then compared to a vanery of various EST databases as described under 
the signal algorithm procedure above, and runner resulted in the identification of Incyte EST1 257046. Further 
examination and sequencing of ihe full-length clone corresponding to this EST sequence resulted in the 
isolation of the full-length DNA sequence DNA68874 (Fig. 83. SEQ ID NO: 188) and the derived PR01387 
native sequence protein UNQ728 (Fig. 84, SEQ ID NO: 189). 
20 Clone DNA68874 (SEQ ID NO: 188) contains a single open reading frame with an apparent 

translation initiation site at nucleotide positions 152-154 and ending at the stop codon (TGA) at nucleotide 
tuitions 866-868 (Figure 83), as indicated by bolded underline. The predicted PROI410 polypeptide 
precursor (i.e.. UNQ728, SEQ ID NO:189) is 238 amino acids long (Figure 84). The UNQ728 protein (SEQ 
ID NO: 1 89) shown in Figure 84 has an estimated molecular weight of about 25.262 daltons and a pi of about 
25 6.44, a signal peptide from about amino acid residue 1 to about residue 20. a transmembrane domain from 
about ammo acid residue 194 to about residue 220 and a potential N-glvcosylation site at about ammo acid 
residue 132. A clone containing DNA68874 (SEQ ID NO: 188) has been deposited with ATCC on September 
22, 1998 and is assigned ATCC deposit no. 203277. 

30 A Q- Isolation of cDNA clones Encoding Human PROI917 qrNO900) 

Use of the signal algorithm procedure described above resulted in the identification of an EST cluster 
sentience 85496. This sequence was then compared to a variety of various EST databases as described under 
the signal algorithm procedure above, and further resulted in the identification of Incyte EST3255033. This 
EST was derived from an ovarian tumor library. Further examination and sequencing of the full-length clone 
35 corresponding to this EST sequence resulted in the isolation of the full-length DNA sequence DNA76400 (Fig. 
85. SEQ ID NO: 190) and the derived PR0191 7 native sequence protein UNQ900 (Fig. 86, SEQ ID NO:l91). 

The full length clone DNA76400 (SEQ ID NO:190) shown in Figure 85 contains a single open 
reading frame with an apparent translation initiation site at nucleotide positions 6 to 9 and ending at me stop 
codon (TGA) found at nucleotide positions 1467 to 1469 as indicated by bolded underline. The predicted 
40 PR01917 polypeptide precursor {U., UNQ900, SEQ ID NO: 191) is 487 amino acids long. UNQ900(SEQID 
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NO-.191) has a calculated molecular weight of approximately 55.051 daltons and an estimated pi of 
approximately 8.14. Additional features include: a signal peptide at about amino acid residues 1-30; potential 
K-glycosylation sites at about amino acid residues 242 and 48 1 . protein kinase C phosphorylation sites at about 
amino acid residues 95-97. 182-184. and 427-429: N-iiiyiiaoylation sites at about amino acid residues 107- 
112, 113-118. 117-122. 118-123. and 128-133: and an endoplasmic reticulum targeting sequence at about 
amino acid residues 484-487. 

AR. Isolation of c DNA clones Encoding Human PRO 1868 (UNQ859) 

Use of the ECD homology procedure described above in a human fetal liver library resulted in the 
identification of EST clone no. 2994689. Further analysis and sequencing of me corresponding full-length 
clone resulted in the isolation of DNA77624 (Fig. 87. SEQ ID NO:192) and the derived PROI868 native 
sequence protein UNQ859 (Fig. 88. SEQ ID NO: 1 93 J. 

Clone DNA77624 (Fig. 88. SEQ ID NO: 193) contains a single open reading fame with an apparent 
translation initiation site at nucleotide positions 51-53 and ending at the stop codon (TGA) at nucleotide 
positions 981-983. as indicated by bolded underline. The predicted PR01868 polypeptide precursor (i.e.. 
UN0859. SEQ ID NOM93. Fi 8 . 89) is 310 amino acids long. The UNQ859 (SEQ ID NO: 1 93) protein shown 
in Figure 89 has an estimated molecular weight of about 35.020 daltons and a pi of about 7.90. a 
transmembrane domain from about amino acid residue 243 to about residue 263. potential N-grycosylation 
sites at about amino acid residues 104 and 192. a cAMP- and cGMP-dependent protein kinase phosphorylation 
site from about amino acid residues 107 to about residue 1 10. casein kinase II phosphorylation sites from about 
amino acid residues 106 to about residue 109 and from about amino acid residue 296 to about residue 299. a 
tyrosine kinase phosphorylation site from about amino acid residue 69 to about residue 77 and potential N- 
myrisrolation sites from about amino acid residue 26 to about residue 31. from about residue 215 to about 
residue 220. from about residue 226 to about residue 231. from about residue 243 to about residue 248. from 
about residue 244 to about residue 249 and from about residue 262 to about residue 267. A cDNA clone 
containing DNA77624 (SEQ ID NO: 193) has been deposited with ATCC on December 22. 1998 and is 
assigned ATCC deposit no 203553. 



AS. Isolation of cDNA clones Encoding Human PRO205 (UNQ179) 

Use of the ECD procedure above resulted in the identification of an EST sequence derived from a 
human retinal library. Additional effort to identify the full length clone using an m vitro cloning procedure 
were unable to identify another PRO205 encoding DNA sequence. 

DNA sequence encoding other polypeptide of substantial homology to the UNQI79 (SEQ ID 
NO-.229) polypeptide of Figure 90 may be found as GenBank submissions AB033089J and HSM802 147 I. 

Clone DNA30868 (SEQ ID NO:89) contains what is believed to be an incomplete open reading frame 
with an apparent translation initiation site at nucleotide positions 405-W7 at indicated by bolded underline in 
Figure 89. The predicted partial length PR01868 polypeptide precursor (Le.. UNQ179. SEQ ID NO-.229) is 
343 amino acids long, has a calculated molecular weight of 39285 daltons and a pi of 6.06. 
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Analysis of the LTNQ179 (SEQ ID NO:229) shown in Figure 90 reveals a signal peptide at about 
amino acid residues 1 to 20, an N-glycosylation site at about amino acid residues 318-322, tyrosine kinase 
phosphorylation sites at about amino acids residues 21-29 and 21 1-220. N-myristolation sites at about residues 
63-69, 83-89 and 317-323 and a prokaryotic membrane lipoprotein lipid attachment site at about residues 260- 
5 271. A cDNA clone containing DNA30868 (SEQ ID NO:228) has been deposited with the ATCC on March 2, 
2000 under the designation DNA30868-1 156 and has been assigned ATCC deposit no. 



AT. Isolation of cDNA clones Encoding Murine PRQ21 (UN02H 

The isolation of DNA36638 (Fig. 91, SEQ ID NO:230) t which encodes the native sequence PR021 

1 0 polypeptide UNQ21 (Fig. 92, SEQ ID NO:23 1 ) has been previously described in U.S.P. 5,955.420. Additional 
cloning and characterizing information can be found in Schneider et aL Cell 54 (6): 787-93 (1988) and in 
Manfiolem et aL, Mol Cell Biol. ±3 (8): 4976-85 (1993). 

Clone DNA36638 contains a single open reading frame with an apparent translation initiation she at 
nucleotide residues 168-170 and ending at the stop codon (TAG) at nucleotide residues 2187-2189 (Figure 91), 

15 as indicated by bolded underline. The predicted PR021 polypeptide precursor Uc. UNQ21. SEQ ID NO:23I) 
is 673 amino acids long, has a calculated molecular weight of 74J12 daltons and a pi of 5.45. A cDNA clone 
containing DNA36638 has been deposited with the ATCC under the designation DNA36638-1056 on 
November 12. 1997 and has been assigned ATCC deposit number 209456. 

Analysis of the ITNQ2 1 polypeptide of Figure 92 (SEQ ID NCX23 1 ) reveals a signal sequence at about 

20 amino acid residues 1-27, a transmembrane domain at about amino acid residues 619-635, N-glycosylation 
sites at about residues 417-421 and 488-492, N-myristolation sires at about amino acid residues 126-132, 135- 
141, 146-152, 173-179, 214-220, 253-259, 346-352, 374-380, 440-446. 479-485, 497-503, 517-523, 612-618, 
aspanic acid and asparagine hydroxylation sites at about amino acid residues 130-142. 168-180, 209-221 and 
248-260. a vitamin K-dependem carboxylation domain and an EGF-like domain cysteine pattern signature at 

25 about amino acid residues 139-151. 



AU. Isolation of cDNA clones Encodmc Human PRQ269 (UNQ236) 

Use of the ECD homology procedure described above in a human fetal kidney library in combination 

with an in vitro cloning procedure using the probe oligonucleotide and one of the primer pairs below resulted 
30 in the identification of the full length DNA sequence DNA38260 (Fig. 93, SEQ ID NO:232) and the derived 

PR0269 native sequence protein UNQ236 (Fig. 94, SEQ ID NO:233). 

The forward and reverse PCR primers and the hybridization probe used were the following: 

forward PCR primer ( .fl ): (SEQ ID NO:234) 

5 t -TGGAAGGAGATGCGATGCCACCTG -3' 
35 forward PCR primer (.fl): (SEQ ID NO:235) 

5 , -TGACCAGTGGGGAAGGACAG-3' 

forward PCR primer (.0): (SEQ ID NO:236) 

S'-ACAC^GCAGAGGGTGCCTTG-y 

reverse PCR primer (.rl ): (SEQ ID NO:237) 

40 r-TCAGGGACAAGTGGTGTCTCTCCCO* 
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reveree PCR primed j2): (SE q m NO:238) 

y-TCAGGGAAGGAGTGTGCAGTTCTG-J- 
hyfaridizarion probe: ^ NO:239 ) 

5 , -ACAGCTCCCGATCTCAGTTACTTGCATCGCGGACGAAATCGGCGCTCGCT-3' 

Clone DNA38260 (SEQ ID NO:232) contains a single open reading frame with an apparent 
translation initiation site at nucleotide positions 314-316 and ending at the stop codon (TAG) at nucleotide 
positions 1784-1786 (Fig. 93). as indicated by bolded underline. The predicted PR0269 polypeptide precursor 
is 490 amino acids long (£«.. UNQ236. Fig. 94, SEQ ID NO:233). has a calculated molecular weight of 51.636 
daltons and a pt of 6.29. A cDNA clone containing DN A3 8260 (SEQ ID NO:232) has been deposited with 
ATCC on October 17. 1997 and is assigned ATCC deposit no. 209397. 

Analysis of the UNQ236 polypeptide of Figure 94 (SEQ ID NO:223) reveals a signal sequence at 
about amino acid residues 1-16. a transmembrane domain at about residues 399-418. N-glycosylation sites at 
about amino acid residues 189-193 and 381-385. a glycosaminoglycan attachment site at about amino acid 
residues 289-293. cAMP- and cGMP-dependent protein kinase phosphorylation sites at about amino acid 
residues 98-102 and 434-J38. N-myristolation sites about amino acid residues 30-36, 35-41.58-64. 59-65. 121- 
127. 151-157. 185-191. 209-215. 267-273. 350-356. 374-380. 453-459. 463^*69 and 477-483 and an aspanic 
acid and asparagine hydroxylation site at about amino acid residues 262-274. 

AV. Isolation of c DNA Encoding Human PRQ344 (UNQltm 

Use of the ECD homology procedure described above in a human fetal kidney library in combination 
with an in vitro cloning procedure using the probe oligonucleotide and one of the primer paira below resulted 
in the identification of the full length DNA sequence DNA40592 (Fig. 95. SEQ ID NO:240) and the derived 
PR0344 native sequence protein UNQ303 (Fig. 96. SEQ ID NO:24 1), 
The forward and reverse PCR primers and the hybridization probe used were the following: 
25 forward PCR primer (34398.fl): (SEQ ID NO:242) 

5'-TACAGGCCCAGTCAGGACCAGGGG-3- 

forward PCR primer (34398.12): (SEQ ID NO:243) 

S'-AGCCAGCCTCGCTCTCGG-y 

forward PCR primer (34398.D): (SEQ , D 

30 5* -GTCTGCGATCAGGTCTGG-3' 

gggfCTjSBg (34398,1 ): (SEQ m m . 245) 

5* -GAAAGAGGCAATGGATTCGC-3' 

rcvcrecPCR P rimcr (34398.r2) : (SEQ m 

SMjACTTACACTTGCCAGCACAGCAC-r 
35 hybridization probe (34398.pl): (SEQ ID NO.247) 

5'-GGAGCACCACCAACTGGAGGGTCCGGAGTAGCGAGCGCCCCGAAG-3' 

Qone DNA40592 (SEQ ID NO:240) contain, a single open reading frame with an apparent 
translation initiation site at nucleotide positions 227-229 and ending at the stop codon (TAG) at nucleotide 
positions 956-958 (Fignre 95). The predicted PR0344 polypeptide precursor (ie. UNQ303. SEQ ID NO:241) 
is 243 amino acids long (Figure 96), has a calculated molecular weight of 25,298 daltons and a pi of 6.44. 
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Analysis of the UNQ303 polypeptide of Figure 96 (SEQ ID NO:241) reveals a signal peptide at about amino 
acid residue 1-15, N-myristolation sites at about amino acid residues 1 1-17. 68-74, and 216-222 and a cell 
attachment site at about amino acid residues 77-80. A cDNA clone containing DNA40592 (SEQ ID NO240) 
has been deposited with ATCC on November 2 1, 1997 and is assigned ATCC deposit no. 209492. 

5 

AX. Isolation of cDNA clones Encodine Human PRQ333 CUNQ294) 

Use of the ECD homology procedure in combination with an in vivo cloning procedure resulted in the 
identification of the partial length sequence DNA4 1 374 (SEQ ID NO:248. Figure 97). 

Clone DNA41374 (SEQ ID NO:248) contains an incomplete open reading frame with an apparent 
10 translation termination site (i.e.. stop codon, TGA) at nucleotide residues 1 1 85- i 187. as indicated in bolded 
underline. The predicted partial length PR0333 polypeptide (i.c. UNQ294, SEQ ID NO:249) is 394 amino 
acids long, a calculate molecular weight of 43.725 daltons and a pi of 8.36. 

Analysis of the UNQ294 (SEQ ID NO:249) polypeptide of Figure 98 reveals a signal sequence at 
about amino acid residues 1-14, a transmembrane domain at about residues 359-376, N-myristoyiation sites at 
15 about amino acid residues 166-172. 206-212. 217-223. 246-252. 308-314. 312-318. 361-367 and an 
immunoglobulin and major histocompatibility complex proteins signature at amino acid residues 315-323. A 

cDNA clone containing DNA4I374 has been deposited with the ATCC on and as assigned 

ATCC deposit number . 



20 AY. Isolation of cDNA clones Encodine Human PRQ381 (UNQ322) 

Use of the ECD homology procedure described above in a human fetal kidney library resulted in the 
identification of the full length DNA sequence DNA44194 (Fig. 99. SEQ ID NO:250) and the derived PR0381 
native sequence protein UNQ322 (Fig. 100. SEQ ID NO:25I). 

The forward and reverse PCR primers and the hybridization probe used were the following: 
25 Forward PCR primer (3965 1 .fl ): (SEQ ID NO:252) 

5*-CTTTCCTTGCTTCACCAACATCAGGC-3' 

Reverse PCR primer (3965 l.rl ): (SEQ ID NO:253) 

S'-GCCCAGAGCAGGAGGAATGATGAGC-r 

hybridization probe (39651 .pi): (SEQ ID NO:254) 

30 5*-GTGG AACGCGGTCTTG ACTCTGTTCGTCA C I ft II 1 G ATTGGGGCTTTG-3' 

Clone DNA44194 (SEQ ID NO:250) contains a single open reading frame with an apparent 
translation initiation site at nucleotide positions 174-176 and ending at the stop codon (TAG) at nucleotide 
positions 807-809 (Fig. 99), as indicated by bolded underline. The predicted PR0381 polypeptide precursor 
(Le^ UNQ322, Fig. 100, SEQ ID NO:25I) is 211 amino acids long, has a calculated molecular weight of 
35 24,172 daltons and has a pi of 5.99. The UNQ322 (SEQ ID NO:251) protein shown in Figure 100 has the 
following features: a signal peptide from about amino acid residues 1 to about 20, a potential N-glycosylation 
site at about amino acid residue 156, potential casein kinase phosphorylation sites from about amino acid 
residues 143 to about 146, about residues 156 to about 159, about residues 178 to about 181, about residues 
200 to about 203, an endoplasmic reticulum targeting sequence from about amino acid residues 78 to about 1 14 
40 and from about residues 118 to about 131, EF-hand calcium binding domain from about amino acid residues 
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140 to about IS9, and an S-100/ICaBP type calcium binding domain from about amino acid residues m to 
about 203. A cDNA clone containing DNA44I94 (SEQ ID NO.250) has been deposited with die ATCC on 
April 28. 1998 and is assigned deposit number 209808. 

AZ. Isolation of c DNA clones Encodine Murine PRQ720 (UNQ388) 

The preparation of DNA535I7 (SEQ ID NO:255) is described above under "X. Isolation of cDNA 
clones Encoding Human PRO770 (UNQ408) .- done DNA53517 (SEQ ID NO:255) contains a single open 
reading frame with as apparent translation initiation site at nucleotide residues 36-38 and ending at the stop 
codon (TAA) at 369-371 (Figure 101), as indicated by bolded underline. The predicted PRO720 polypeptide 
precursor (i.e.. UNQ388. SEQ ID NO:256) is III amino acids long (Figure 102). has a calculated molecular 
weight of 1 1.936 daltons and a pi of 5.21 . 

Analysis of the UNQ388 (SEQ ID NO:256) polypeptide of Figure 102 reveals a signal sequence at 
about amino acid residues 1-23. N-myristolation sites at about amino acids residues 70-76 and 75-81 and 
piokaryoric membrane lipoprotein lipid attachment sites at 66-77 and 68-79. A cDNA clone containing 
DNA53517 (SEQ ID NO:255) has been deposited with the ATCC on April 23. 1998 and is assigned deposit 
number 209802. 
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BA. Isolation of cDNA clones Encodine Human PRQ866 (UNQ43S1 

Use of the ECD homology procedure described above in a human fetal kidney library resulted in the 
identification of the full length DNA sequence DNA5397I (Fig. 103, SEQ ID NO:257) and the derived 
PR0866 native sequence protein UNQ435 (Fig 104. SEQ ID NO:258). 
The forward and reverse PCR primers and (he hybridization probe used were the following: 
Forwart PCR primer (44708.fl): (SEp m 

y-CAGCACrGCCAGGGGAAGAGGGO' 

Forward PCR primer (44708.£2): (SEQ ID NO:260) 

5'-CAGGACTCGCTACGTCCG-3• 

Forward PCR primer (44708.D): (SEQ , D NO;26 , , 

3'-CAOCCCCTTCTCCTC Cl 1 r C TC CC-3' 

ReversePCR P rimer(44708.rl): (SEQ ID NO:262) 

5'-GCAGTTATCAGGGACGCACTCAGCC-3' 

Revo* PCR primer (44706x2): (S£Q ^ NO;263) 

5"-CCAGCGAGAGGCAGATAG-3' 

Reverse PCRpri»er<44706,3): (SEQ ID NO:264) 

5* -CGGTCACCGTGTCCTGCGGGATG-3' 

hybridization probe (44708.pl): (SEQ ID NO:265) 

y-CAGCCCCrTCrCCT(XTTTCTCCCACGTCCTATCTGCCICTC-3' 

The done DNA53971 (SEQ ID NO:257) contains a single open reading fame with an apparent 
ovulational initiation sac at nucleotide positions 275-277 and ending at the stop codon (TAA) at nucleotide 
positions 1268-1270 (Figure 103). as indicated by bolded underline. The predicted native sequence PRO866 
polypeptide precursor (Lc. UNQ435. SEQ ID NO:258) is 331 amino acids (Figure 104), has a calculated 
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molecular weight of 35,844 daltons and a pi of 5.45. The UNQ435 (SEQ ID NO:258) protein shown in figure 
104 has an estimated molecular weight of about 35,544 daltons and a pi of about 5.45. Further analysis reveals 
a signal peptide from about amino acid residue 1 to about residue 26, glycosaminoglycan attachment sites at 
about amino acid residues 131-135, cAMP- and cGMP-dependent protein kinase phosphorylation sites at about 
5 amino acid residues 144-148 and N-myristoylauon sites at amino acid residues 26-32, 74-80, 132-138, 134- 
140, 190-196, 287-293 and 290-296. A cDNA clone containing DNA53971 (SEQ ID NO:257) has been 
deposited with the ATCC on April 6, 1 998 and is assigned deposit no. 209750. 

BB. Isolation of cDNA clones Encoding Human PRO840 (UNQ433) 

10 The use of a yeast screen procedure on tissue isolated from a human thyroid library resulted in an EST 

sequence which served as the template for the creation of PCR oligonucleotides which ultimately resulted in 
the isolation of DNA53987 (SEQ ID NO:266. Figure 105) and the derived PRO840 native sequence protein 
UNQ433 (SEQ ID NO:267, Figure 106). 

A nucleotide sequence encoding a polypeptide of substantial homology with LTNQ433 (SEQ ID 
15 NO:267) of Figure 106 is also available from GenBank as accession number HEEPSSARC_I. 

DNA53987 (SEQ ID NO:266) as shown in Figure 105 contains an open reading frame with a 
translation initiation site at about nucleotide residues 18-20 and ending at the stop codon (TGA) at nucleotide 
residues 1329-133 L as indicated by bolded underline. The second methionine codon at nucleotide residues 90- 
92 could possibly also be the actual translation initiation site - alternatively, this codes for an internal 
20 methionine. The predicted PRO840 polypeptide (i.e.. the longer translation) has been termed UNQ433 (SEQ 
ID NO:267) and is 437 amino acids long (Figure 1 06), has a calculated molecular weight of 49,851 daltons and 
a pi of 6.47. 

A cDNA clone containing DNA53987 (SEQ ID NO:266) has been deposited with the ATCC on May 
12, 1998 under ATCC deposit number 209858. 
25 Analysis of the UNQ433 polypeptide of Figure 106 (SEQ ID NO:267) reveals a signal sequence at 

about amino acid residues 1-46. a transmembrane domain at about amino acid residues 319-338. an N- 
glycosyiation site at about residues 200-204. a cAMP and cGMP-dcpendent protein kinase phosphorylation 
sites at amino acid residues 23-27, tyrosine kinase phosphorylation sites at amino acid residues 43-52 and N- 
myristolylation sites at residues 17-23. 112-118, 1 16-122 and 185-191. 

30 

BC. Isolation of cDNA clones Encoding Human PRQ982 (UNQ483) 

Use of the signal algorithm procedure described above resulted in the identification of an EST cluster 
seq ue nc e no. 43715. This sequence was men compared to a variety of various EST databases as described 
under die signal algorithm procedure above, and further resulted in the identification of Merck EST No. 
35 AA024389. The mil- length clone corresponding to this EST resulted in the identification of the full-length 
sequence DNA57700 (Fig. 107, SEQ ID NO:268) and the derived PR0982 native sequence protein UNQ483 
(Fig. 108, SEQIDNO:269). 

The DNA57700 sequence of Figure 107 (SEQ ID N0268) contains a single open reading frame with 
an apparent translation initiation site at nucleotide positions 26-28 and ending at the stop codon (TAA) round at 
40 nucleotide positions 401-403, as indicated by bolded underline. The prediced PR0982 polypeptide precursor 
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(tt. UNQ982. SEQ ID NO:191) is 124 amino acids in length, has a calculated molecular weignt ot 
approximately 14,198 daltons and an estimated pi of approximately 9.01 (Fig. 108). Further analysis of the 
UNQ483 (SEQ ID NO:269) polypeptide of Figure 108 reveals a signal peptide from about amino acid residues 
1 to about 21 and potential anaphylatoxin domain from about amino acid residue I to about residue 59. a 
cDNA clone containing DNA57700 (SEQ ID NO:268) was deposited with the ATCC on January 12, 1999 and 
is assigned ATCC deposit No. 203583. 

BD. Isolation of c DNA clones Encoding Human PRQ836 < UNQS4S) 

Use of the signal algorithm procedure described above resulted in the identification of EST clusters 
which were then compared to a variety of various EST databases as described under the signal algorithm 
procedure above, and further resulted in ihe identification of lncyte EST 26 1 0075, an EST derived from colon 
tumor tissue. The full-length clone corresponding to this EST resulted in the identification of the full-length 
sequence DNA59620 (Fig. 109, SEQ ID NO:270) and the derived PR0836 native sequence protein UNQ545 
(Fig. 110.SEQIDNO.271). 

The nucleotide sequence DNA59620 (SEQ ID NO:270) shown in Figure 109 contains a single open 
reading frame with an apparent translation initiation site at nucleotide positions 65-67 and ending at the stop 
codon (TCA) at nucleotide positions I44R-I450 (Fig. 109). as indicated by bolded underline. The predicted 
PR0836 polypeptide precursor (i.e.. UNQ545. Fig. 110. SEQ ID NO:271) is 461 amino acids in length 
UNQ545 (SEQ ID NO:27I) shown in Figure 1 10 has an estimated molecular weight of about 52.085 daltons 
and a pi of about 5.36. Further analysis reveals a signal peptide at about amino acid residues I to about 29. N- 
glycosylation sites at about amino acid residues 193 and 236 and N-myristoylauon sites at about residues 15. 
19. 234. 251. 402 and 451. a domain conserved in the YJLl26w/YLR35lc/yhcX family of proteins at about 
ammo acid residues 364 to about 372. and a region having sequence identity with SLS1 protein at about amino 
acid residues 68 to about 340. 

A cDNA clone containing DNA59620 (SEQ ID NO:270) has been deposited with the ATCC on 16 June 1998 
and is assigned deposit number 209989. 

BE - Isolation of cDNA clones Encoding Human PROI 159 (UNQ589) 

Use of the signal algorithm procedure described above resulted in the identification of EST cluster 
sequence 77245, which was then compared to a variety of various EST databases as described under the signal 
algorithm procedure above, and further resulted in the identification of lncyte EST no. 376776. Analysis of the 
full-length clone corresponding to this EST resulted in the identification of the full-length sequence 
DNA60627(Fig. 111. SEQ ID HOJ.72) and the derived PROI 159 native sequence protein UNQ589 (Fig. 112, 
SEQIDNO:273). 

Clone DNA60627 (SEQ ID NO:272) contains a single open reading frame with an apparent 
translation initiation site at nucleotide positions 92-94 and ending at the stop codon (TAG) at nucleotide 
positions 362-364 (Figure 111), as indicated by bolded underline. The predicted PROI 159 polypeptide 
precursor (£e.. UNQ589. SEQ ID NO:273) is 90 ammo acids long (Figure 112). The UNQ589 {SEQ ID 
NO:273) protein shown m Figure 1 12 has an estimated molecular weight of about 9.840 daltons and a pi of 
about 10.13. 
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AnaJysis of the UNQ589 (SEQ ID NO:273) sequence shown in Figure 1 12 evidences the presence or 
the following: a signal peptide from about amino acid residue 1 to about residue 15 and a potential N- 
giycosylation site at about amino acid residue 38. Clone DNA60627 (SEQ ID NO:272) has been deposited 
with ATCC on August 4, 1998 and is assigned ATCC deposit no. 203092. 

5 

BF. Isolation of cDNA clones Encoding Human PRQ1358 (UNQ707) 

Use of the signal algorithm procedure described above resulted in the identification of an EST cluster 
sequence, which was then compared to a variety of various EST databases as described under the signal 
algorithm procedure above, and further resulted in the identification of Incyte EST 088718. a fragment derived 

10 from a liver tissue library. Analysis of the full-length clone corresponding to the EST resulted in the 
identification of the full-length sequence DNA64890 (Fig. 1 13. SEQ ID NO:274) and the derived PR01358 
native sequence protein UNQ707 (Fig. 1 14. SEQ ID NO:275). 

The DNA64890 (SEQ ID NO:274) clone shown in Figure 1 13 contains a single open reading frame 
with an apparent translation initiation site at nucleotide positions 86 through 88 and ending at the stop codon 

15 (TAA) found at nucleotide posiuons 1418 through 1420 (Figure 1 13). as indicated by bolded underline. The 
predicted PR01358 polypeptide precursor (Lc. UNQ707, SEQ ID NO:275) is 444 amino acids long, and a 
signal peptide is at about amino acid residues 1-18. UNQ707 (SEQ ID NO:275) has a calculated molecular 
weight of approximately 50719 daltons and an estimated pi of approximately 8.82. A cDNA clone containing 
DNA64890 (SEQ ID NO:274). designated as DNA64890-I612, was deposited with the ATCC on August 18. 

20 1998 and is assigned ATCC deposit no. 203 13 1. 

BG. Isolation of cDNA clones Encodinc Human PRQ1325 (UNQ685) 

Use of the signal algorithm procedure described above resulted in the identification of the EST cluster 
sequence no. 139524, which was then compared lo a variety of various EST databases as described under the 
25 signal algorithm procedure above, and further resulted in the identification of Incyte EST 3744079. Analysis 
of the full-length clone corresponding to the EST resulted in the identification of the full-length sequence 
DNA66659 (Fig. 1 15. SEQ ID NO:276) and the derived PROI325 native sequence proiein UNQ685 (Fig. 116. 
SEQ ID NO:277). 

Clone DNA66659 (Fig. 1 15. SEQ ID NO:276) contains a single open reading frame with an apparent 
30 translation initiation site at nucleotide positions 51-53 and ending at the stop codon (TAG) at nucleotide 
positions 2547-2549, as indicated by bolded underline. The predicted PRO 1325 polypeptide precursor (i.e^ 
UNQ685, SEQ ID is 832 amino acids long. The UNQ685 (SEQ ID NO:227) protein shown in Figure 

1 16 has an estimated molecular weight of about 94,454 daltons and a pi of about 6.94. Further analysts of 
UNQ685 (SEQ IDN0.227) reveals: a signal peptide from about amino acid 1 to about amino acid 18, 
35 transmembrane domains from about amino acid 292 to about amino acid 317, from about amino acid 451 to 
about amino acid 470, from about amino acid 501 to about amino acid 520, from about amino acid 607 to 
about amino acid 627 from about amino acid 751 to about amino acid 770, a leucine zip pe r pattern sequence 
from about amino acid 497 to about amino acid 518 and potential N-glycosyiation sites from about amino acid 
27 to about amino acid 30, from about amino acid 54 to about amino acid 57, from about amino acid 60 to 
40 about amino acid 63, from about amino acid position 123 to about amino acid position 126, from about amino 
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acid portion 141 to about ammo acid position 144. from about amino acid position 165 to about amino acid 
posmon 168, from about amino acid position 364 to about amino acid position 367. from about amino acid 
position 476 to about amino acid position 479. from about amino acid position 496 to about amino acid 
position 499, from about amino acid position 572 to about amino acid position 575, from about amino acid 
petition 603 to about amino acid position 606 and from about amino acid position 699 to about amino acid 
I>osition 702. A cDNA clone containing DNA66659 (SEQ ID NO:276)> has been deposited with ATCC on 
September 22, 1 998 and is assigned ATCC deposit no. 203269. 

BH. Isolation of cDNA clones Encoding Human PRQ1338 (UNQ693) 

The use of yeast screens resulted in EST sequences which were then compared to various public and 
private EST databases in a manner similar to that described above under ECD homology resulted in the 
identification of Incytc EST26I5184, an EST derived from cholecystitis gall bladder tissue. Analysis of the 
corresponding full-length sequence ultimately resulted in the isolation of DNA66667 (SEQ ID NO:278, Figure 
1 17) and the derived PRO 1338 native sequence protein UNQ693 (SEQ ID NO:279. Figure 118). 

DNA66667 (SEQ ID NO:278) as shown in Figure 1 17 contains a single open reading frame with a 
translation initiation site at about nucleotide residues 115- II 7 and ending at the stop codon (TAA) at 
nucleotide positions 2263-2265. as indicated by bolded underline. The predicted PROI338 polypeptide 
precursor UNQ693. SEQ ID NO:ll8) is 716 amino acids in length (Figure 118), has a calculated 
molecular weight of 80.716 daltons and a pi of 6.06. 

Analysis of the UNQ693 polypeptide (SEQ ID NO:278) of Figure 118 reveals a signal sequence at 
about amino acid residues I to 25. a transmembrane domain at about amino acid residues 629-648, N- 
grycosylation sites at about amino acid residues 69-73, 96-100, 106-1 10. 1 17-12L 385-389, 517-521. 582-586 
and 61 1-615, a tyrosine kinase phosphorylation site at about residues 573-582 and N-myristoylation sites at 
about amino acid residues 16-22. 224-230. 464-470, 637-643 and 698-704. 

A cDNA containing DNA66667 (SEQ ID NO:278) has been deposited with the ATCC under the 
designation DNA66667-1 596 on September 22. 1 998 and has been assigned ATCC deposit number 203267. 

BI. Isolation of cDNA clones Encoding Human PRO 1434 (UNQ739) 

Use of ECD homology procedure described above in a human retina tissue library resulted in the 
identification of the full-length DNA sequence DNA688I8 (Fig. 119, SEQ ID NO:280) and the derived 
PR01434 native sequence protein UNQ739 (Fig. 120, SEQ ID NO:281). 

The PCR primers (forward and reverse) and hybridization probe synthesized in this procedure were 
the following: 
forward PCR pitmen 

5»-G AGGTGTCGCTGTGAAGCCAACGG-3' (SEQ m NO: 282) 

revei&e PCR primer 

y-CGCTCGATTCTCCATCTGCCTTCC-3' (SEQ ID NO:283) 

hybridization probe: (SEQ ID NO:284) 

y-GACGGAGTGTGTGGACCCTGTGTACGACCCTGATCAOTGCTGTCCO' 
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Clone DNA68818 (SEQ ID NO:280) contains a single open reading frame with an apparent 
translation initiation site at nucleotide positions 581-583 and ending at the stop codon (TAG) at nucleotide 
positions 1556-1558 (Figure 119), as indicated by bolded underline The predicted PRO 1434 polypeptide 
precursor (Le., UNQ739, SEQ ED NO:28l) is 325 amino acids bng (Figure 120). The UNQ739 (SEQ ID 
5 NO:28 1 ) protein shown in Figure 120 has an estimated molecular weight of about 35,296 daltons and a pi of 
about 5 J 7. Further analysis reveals a signal sequence at about amino acid residues 1-27, a glycosaminoglycan 
attachment site at about amino acid residues 80-84, M-myristoylation sites at about amino acid residues 10-16, 
102-108, 103-109, a cell attachment sequence at about amino acid residues 1 14-1 17 and an EGF-like domain 
cysteine pattern signature at about amino acid residues 176-188. 
10 A clone containing DNA68818 (SEQ ID NO:280) has been deposited with ATCC under the 

designation DNA688 18-2536 on February 9, 1999 and is assigned ATCC deposit no. 203657. 

BJ. Isolation of cDNA clones Encoding Human PRQ4333 (UNO! 888) 

An expressed sequence tag (EST) DNA database (UFESEQ* Lncytc Pharmaceuticals, Palo Alto, CA) 
15 was searched in a manner similar to that described above under the ECD homology procedure described above 
and an EST was identified which showed homology to lymphotoxin-bcta receptor. 

The EST served as the template to create oligonucleotide primers and probes to screen a human fetal 
kidney library in a manner similar to that described above under the ECD homology procedure. 
The oligonucleotides created for the above procedure were the following: 
20 forward PGR primer {SE q id NO:287) 

5MjCAAGAATrCAGGGATCGGTCTGG-3' 

P* 00 * (SEQ ID NO:288) 

5* -CTGTGTTCCCTGCAACCAGTGTGGGCCAGGCATGG AGTTGTCTAAGG-3* 

(SEQ ID NO:289) 

25 5'-AGATGGCATCACTG GTGGCTGAAC-3* 

forward: (SEQ ID NO:290) 

5'-CAGAAGGCAAATTGTTCAGCCACCAG-3 , 

rcvctsc; (SEQIDNO:291) 
5'-ACAGTrTCCAGACCGATCCCTGAATTC-r 

30 The result was the isolation of the full-length DNA sequence DNA84210 (SEQ ID NO:285, Figure 

121). The DNA84210 (SEQ ID NO:285) clone depicted in Figure 121 contains a single open reading frame 
with an apparent translation initiation site at nucleotide positions 185-187, and a stop codon (TAA) at 
nucleotide positions 1436-1438. as indicated by bolded underline. The predicted PR04333 polypeptide 
precursor (U. t UNQ1888, SEQ ID NO:286) is 417 amino acids long. The UNQ1888 protein (SEQ ID 

35 N0086) shown in Figure 121 has an estimated molecular weight of about 45305 daltons and a pi of about 
5.12. 

Analysis of the UNQ1888 polypeptide (SEQ ID NO:286) of Figure 121 reveals a signal peptide at 
about amino acid residues 1-25, a transmembrane domain at about residues 169-192, N-giycosylation sites 
about residues 105-109, 214-218, 319-323, 350-354, 368-372, 379-383, cAMP- and cGMP-dependent protein 
40 kinase phosphorylation sites at about residues 200-204 and 238-242, a tyrosine kinase phosphorylation site at 
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about residues 207-214. an N-nryrismylation site at about residues 55-61. 215-218 and 270-276, a prokaiyotic 
membrane lipoprotein lipid attachment site at about residues 259-270 and a TNFR/NGFR family cysteme-rich 
region at about residues 89-96. 

A cDNA clone contaming DNA84210 (SEQ ID NO:285) f designated as DNA842 10-2576, has been 
deposited with ATCC on March 2, 1 999 and is assigned ATCC deposit no. 2038 1 8. 

BK. Isolation of cDNA clones Encoding Human PRO4302 flJNQl 866) 

Use of the amylase screen procedure described above on tissue isolated from human tissue resulted in 
an EST sequence which was then compared against various EST databases to create a consensus sequence by a 
methodology as described above under the amylase yeast screen procedure and/or the ECD homology 
procedure. Further analysis of this consensus sequence resulted in the identification of Incyte EST no. 
2408081H1. Analysis of the full-length clones corresponding to EST no. 240808IH1 resulted in the isolation 
of me full length native sequence clones DNA92218 (SEQ ID NO:292) and the derived PRO4302 fiiU-length 
native sequence protein UNQ 1866 (SEQ ID NO:293). 

The mil length clone DNA92218 (SEQ ID NO:292) shown in Figure 123 has a single open reading 
frame wuh an apparent translation^ initiation site at nucleotide positions 174-176 and a stop signal (TAG) at 
nucleotide positions 768-770. as indicated by bolded underline. The predicted PRO4302 polypeptide precursor 
(Le.. UNQ1866. SEQ ID NO:293) is 198 amino acids long, has a calculated molecular weight of approximately 
22085 daltons and an estimated pi of approximately 9.35. Analysis of UNQ 1866 (Fig. 124, SEQ ID NO:293) 
reveals a signal peptide from about amino acid residue I to about residue 23. a transmembrane domain from 
about amino acid residue III to about residue 130, a cAMP and cGMP-dependent protein kinase 
phosphorylation sites at residues 26-30. casein kinase II phosphorylation sites at residues 44-47 and 58-61, a 
tyrosine kinase phosphorylation site at residues 36-43 and N-myristoylation sites at residues 124-130. 144-150 
and 189-195. 

A cDNA clone containing DNA92218 (SEQ ID NO:292), designated DNA922 18-2554, was 
deposited with the ATCC on March 9. 1999 and has been assigned deposit number 203834. 

BL. Isolation of cDNA clone s Encoding Human PRO4430 (UNO 1947) 

Use of the signal algorithm procedure described above resulted in the identification of an EST cluster 
sequence, which was then compared to a variety of various EST databases as described under the signal 
algorithm procedure above, and further resulted in the identification of a consensus sequence. Further analysis 
of the consensus sequence resulted in the identification of the full-length sequence DNA96878 (Fig. 125, SEQ 
ID NO:294) and the derived PRO4430 native sequence protein UNQ1 947 (Fig. 126, SEQ ID H009S). 

The native sequence DNA sequence DNA96878 (SEQ ID NO:294) shown in Figure 125 contains a 
single open reading frame with an apparent translation initiation site at nucleotide positions 56-58 and ending 
at the stop codon (TGA) found at nucleotide positions 43M33, as indicated by bolded uridcrline. The 
predicted PRO4430 polypeptide precursor (UNQ1947, Fig. 126, SEQ ID N0295) is 125 amino acids long. 
The UNQ4430 protein (SEQ ID N0295) of Figure 126 has a calculated molecular weight of ar^tmately 
13821 daltons and an estimated pi of approximately 8.6. Further analysis reveals the presence of a signal 
sequence at about amino acid residues 1 to about 18, N-glycosylation sites at about residues 77-80 and again at 
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about residues 88-91, a casein kinase H phosphorylation site at about residues 67-70, an N-rayristoylation site 
at about residues 84-89 and a Lys-6/u-F AR domain at about residues 85-98. 

A clone containing DNA96878 (SEQ ID NO:294), designated DNA96878-2626, was deposited with 
the ATCC on May 4, 1999 and is assigned ATCC deposit no. 23-PTA. 

BM. Isolation of cDNA clones Encoding Human PRQ5727 (UN02448) 

Various known TNF-receptors were used to screen public and private EST databases (e.g., see BCD 
homology procedure, above) resulting in the identification Incyte clone 509151 1H. This EST sequence, which 
was derived from uterine tumor tissue, then served as a template for the construction of the cloning oligos 
indicated below which were then used to identify by PGR a human thymus cDNA library that contained the 
sequence of interest. These oligonucleotides were: 
Forward primer (509-1): 

5'-G AGGGGGCTGGGTG AG ATGTG-3 * (SEQ ID NO:298) ( 

Reverse primer ( 509-4 AS): 

5*-TGLi j i t GTACCTGCG AGGAGG-3* (SEQ ID NO:299) 

To isolate the DNA sequence encoding the full-length DNA98853 polypeptide, an inverse long 

distance PCR procedure was carried out (Figure 129). The PCR primers generally ranged from 20 to 30 

nucleotides. For inverse long distance PCR, primer pairs were designed in such a way that the 5* to 3' direction 

of each primer pointed away from each other. 

A pair of inverse long distance PCR primers for cloning DNA98853 were synthesized: 

Primer 1 (left primer) (509-P5): 

5-pCATGGTGGGAAGGCCGGTAACG-3' (SEQ ID NO300) 

Primer 2 (right primer) (509-P6): 

5 , -pGATTGCCAAGAAAATGAGTACTGGGACC-3* (SEQ ID NO:301 ) 

In the inverse long distance PCR reaction, the template is the plasmid cDNA library. As a result, the 

PCR products contain the entire vector sequence in the middle with insert sequences of interest at both ends. 

After the PCR reaction, the PCR mixture was treated with Dpn I which digests only the template plasmids. 

followed by agarose gel purification of PCR products of larger than the size of the library cloning vector. 

Since the primers used in the inverse long distance PCR were also 5'-phosphorylated. the purified products 

were then self-ligated and transformed into E.coli competent ceils. Colonies were screened by PCR using 5* 

vector primer and proper gene specific primer to identify clones with larger 5' sequence Plasmids prepared 

from positive clones were sequenced. If necessary, the process could be repeated to obtain more 5* sequences 

based on new sequence obtained from the previous round. 

The purpose of inverse long distance PCR is to obtain the complete sequence of the gene of interest. 

The clone containing the full length coding region was then obtained by conventional PCR. 

Tne primer pair used to clone the full length coding region of DNA98853 (SEQ ID N0296) were the 

following: 

Forward primer (Cla-MD-509): 

$ -GGAGGATCG ATACCATGGATTGCCAAGAAAATG AG-3* (SEQ ID NO:302) 

Reverse primer (509.TAA.not): 
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y^GGAGCGGCCGCTrAAGGGCTGGGAACrrCAAAGGGCAC^ (SEQ ID NO303) 

For cloning purposes, a Cla I site and a Not I site were included in the forward primer and reverse 
primer respectively. 

To ensure the accuracy of the PCR products, independent PCR reactions were jwformed and several 
cloned products were sequenced. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
DNA98853 (SEQ ID NO*!*, Figure 127) and the derived PROS727 native sequence protein UNQ2448 (SEQ 
ID NO:297, Figure 128). 

Clone DNA98853 (SEQ ID NO:296) contains a single open reading fnune with an apparent 
translation initiation site at nucleotide positions 1-3 and ending at the stop codon (TAA) at nucleotide positions 
901-903 (Figure 127), as indicated by bolded underline. The predicted PR05727 polypeptide precursor (Le. 
UNQ2448, SEQ ID NO:297) is 299 amino acids long (Figure 128). has a calculated molecular weigh, of 
32.929 daltons and a pi of 4.95. The UNQ2448 polypeptide (SEQ ID NO:297) shown in Figure 128 has an 
estimated molecular weigh, of about 3 J kilodaltons and a pi of about 4.72. A potential N-glycosyiation site 
exists between amino acids 74 and 77 of the ammo acid sequence shown in Figure 128. A potential N- 
myristoylation site exists between amino acids 24 and 29 of the ammo acid sequence shown in Figure 128. 
Potential casein kinase 11 phosphorylation sites exist between amino acids 123-126. 185-188. 200-203 252- 
255. 257-260, 271-274. and 283-286 of the amino acid sequence shown in Figure 128. A potential 
uansmembrane domain exists between amino acids 137 to 158 of the sequence shown in Figure 128. It is 
presently believed that the polypeptide does not include a signal sequence. 

A cDNA clone containing DNA98853 (SEQ ID NO:296, designated DNA98853-I739. has been 
deposited with ATCC on April 6. 1 999 and is assigned ATTC Deposit No. April 6. 1999. 

EXAMPLE 2 

Stimulatory Activity in Mixed I vmnhoevte R«^i on (MLR) Assay (no.24) 
This example shows that the polypeptides of the invention are active as a stimulator of the 
proliferation of stimulated T-lym P hocytes. Compounds which stimulate proliferation of lymphocytes are 
useful therapeutically where enhancement of an immune response is beneficial. A therapeutic agent may take 
the form of antagonists of the polypeptide of die invention, for example, murine-human chimeric, humanized 
or human antibodies against the polypeptide. 

The basic protocol for this assay is described in Current Protocols in Immunology, unit 3.12; edited 
by J. E. Coligan, A. M. Kruisbeek. D. H. Marglies, E M. Sbevach. W. Strober. National Institutes of Health. 
Published by John Wiley & Sons, fate 

More specifically, in one assay variant peripheral blood mononuclear cells (PBMC) are isolated from 
mammalian individuals, for example a human volunteer, by leukopheresis (one donor will supply stimulator 
PBMCs. the other donor will supply respond* PBMCs). If desired, the cells are frozen in fetal bovine serum 
and DMSO after isolation. Frozen cells may be thawed overnight in assay media (37°C, 5% C0 2 )and then 
washed and resuspended to 3 x 10« cells/ml of assay media (RPMI; 10% fetal bovine serum. 1% 
pemcfll^snepmmycm, IV.glutamine, IV.HEPES, 1% non-essential amino acids, 1% pyruvate). 
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The stimulator PBMCs are prepared by irradiating the cells (about 3000 Rads). The assay is prepared 
by plating in triplicate wells a mixture of: lOOul of test sample diluted to 1% or to 0.1%; 50 ul of irradiated 
stimulator cells and 50 ul of responder PBMC cells. 1 00 microliters of cell culture media or 100 microliter of 
CD4-IgG is used as the control. The wells are men incubated at 37°C. 5% CO2 for 4 days. On day 5 and each 
5 well is pulsed with tritiated thymidine (1.0 mC/well: Amersham). After 6 hours the cells are washed 3 times 
and then the uptake of the label is evaluated. 

In another variant of this assay, PBMCs are isolated from the spleens of Balb/c mice and C57B6 mice. 
The cells are teased from freshly harvested spleens in assay media (RPM1; 1 0% fetal bovine serum, 1% 
penicillin/streptomycin, 1% glutamine. 1% HEPES, 1% non-essential amino acids. 1% pyruvate) and the 

10 PBMCs are isolated by overlaying these cells over Lympholyte M (Organon Tcknika), cemrimging at 2000 
rpm for 20 minutes, collecting and washing the mononuclear cell layer in assay media and resuspending the 
cells to lx 10 7 cells/ml of assay media. The assay is then conducted as described above. The results of this 
assay for compounds of the invention are shown below. Positive increases over control arc considered positive 
with increases of greater than or equal to 1 80% being preferred. However, any value greater than control 

1 5 indicates a stimulatory effect for the test protein. 
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Table 7 




PRO 


PRO Concentration 


Percent Increase Over C 


PR0356 


0.1% 


133.8 


PR0356 


0.1% 


208.9 


PR0356 


1.0% 


25 L6 


PR0356 


1.0% 


332.1 


PR0273 


12.4 nM 


112 


PR0273 


124 nM 


192.7 


PR0769 


23.86 nM 


76.3 


PR0769 


238.6 nM 


226 


PR01184 


16.88 nM 


81.6 


PROH84 


168.82 nM 


194.4 


PR01346 


3.34 nM 


86.6 


PR01346 


33.41 nM 


1883 


PR01246 


0.07 nM 


145 


PR01246 


0.7 nM 


180.9 


PR0269 


0.1% 


122.4 


PR0269 


1% 


194.1 


PR0344 


0.1% 


148.6 


PR0344 


1% 


259.9 


PR0333 


0.1% 


187.8 


PR0333 


1% 


220 


PR0381 


143 nM 


87J 


PR0381 


143 nM 


135.4 
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PR0381 


145 nM 


248.1 


PR0381 


145 nM 


290.8 


PR0533 


0.06 nM 


163 


PR0533 


0.61 dM 


3824 


PRO720 


0.1 nM 


198.4 


PRO720 


1.0 nM 


293.5 


PRO866 


0.1 nM 


131.8 


PRO866 


1.04 nM 


223.2 



EXAMPLE 3 
Hairless Guinea pip Proinflammatory Assay (no. 32) 

This assay is designed to determine whether the PRO polypeptides show the ability to induce vascular 
permeability. Polypeptides testing positive in this assay are expected to be useful for the therapeutic treatment 
of conditions which would benefit from enhanced vascular permeability including, for example, conditions 
which may benefit from enhanced local immune system cell infiltration. 

Hairless guinea pigs weighing 350 grams or more were anesthetized with Ketammc (75-80 mg/kg) 
and 5 mg/kg Xylazine intramuscularly. Test samples containing the PRO polypeptide or a physiological buffer 
without me test polypeptide are injected into skin on the back of the test animals with 100 ul per injection site 
mtradermally. There were approximately 16-24 injection sites per animal. One ml of Evans blue dye (IV. in 
PBS) is then injected mtracardiaily. Skin vascular permeability responses to the compounds (/.c. blemishes at 
the injection sites of injection) are visually scored by measuring the diameter (in mm) of blue-colored leaks 
from the site of injection at 1 . 6 and/or 24 hours post admimstration of the test materials. The mm diameter of 
blueness at the site of injection is observed and recorded as well as the severity of the vascular leakage for 
values scoring above 4 standard deviations over the same animal control. Blemishes of at least 5 mm in 
diameter are considered positive for the assay when testing purified proteins, being indicative of the ability to 
induce vascular leakage or permeability. A response greater than 7 mm diameter is considered positive for 
amdinoned media samples. Human VEGF is used as a positive control inducing a response of 4-8 mm 
diameter at 0.1 ug/100 uL, and 15-23 mm diarn. at I ug/100 ul. 

The tested polypeptide are diluted to 1% of the initial stock solution. UNQ 585 was diluted into 10 
mM HEPES/140 mM NaCl/4% mannitol/1 mg/ml BSA pH 6.8, while UNQ334 was diluted into 140 mM 
NaCl, 10 mM Hcpes, 4% Mannitol pH 7.4. 

Table 8 

UNQ polypeptide Stock solution conr^mm™ Time (hp dialation fmm) 

PROI155 20^84nM | * 6 

PROn55 20.384 nM 6 6 

PKOSH 1024 nM 1 5.4 

W^ 0533 1024 nM 6 7 

P* 021 22,000 nM | 2.0 

P* 021 22,000 nM 6 14.0 ~ 
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EXAMPLE 4 
Skin Vascular Permeability Assay fno.64) 
This assay shows that certain PRO polypeptides stimulate an immune response and induce 
inflammation by inducing mononuclear cell, eosinophil and PMN infiltration at the site of injection of the 
5 animal. This skin vascular permeability assay is conducted as follows. Hairless guinea pigs weighing 350 
grams or more are anesthetized with kecamine (75-S0 rag/Kg) and 5 rag/Kg Xylazine intramuscularly (IM). A 
sample of purified PRO polypeptide or a conditioned media test sample is injected intradermal^ onto the backs 
of the test animals with 100 uL per injection site. It is possible to have about 10-30, preferably about 16-24, 
injection sites per animal One mL of Evans blue dye (1% in physiologic buffered saline) is injected 
10 irtnacardially. Blemishes at the injection sites are then measured (mm diameter) at Ihr, 6 hrs and 24 hrs post 
injection. Animals were sacrificed at 6 hrs after injection. Each skin injection site is biopsied and fixed in 
paraformaldehyde. The skins are then prepared for histopathologic evaluation. Each site is evaluated for 
inflammatory cell infiltration into the skin. Sites with visible inflammatory cell inflammation are scored as 
positive. Inflammatory cells may be neutrophilic eosinophilic, monocytic or lymphocytic 
15 At least a minimal perivascular infiltrate at the injection site is scored as positive, no infiltrate at the 

site of injection is scored as negative. 



Table 9 



UNO 


Time (hrs) 


Infiltrate 


PR0172 


24 


positive 


PRO200 


24 


positive 


PRO200 


24 


positive 


PR0216 


24 


positive 


PR0272 


24 


positive 


PR0362 


24 


positive 


PRO1007 


24 


positive 


PRO 103 1 


24 


positive 


PROI283 


24 


positive 


PR01343 


24 


positive 


PR01358 


6 


positive 


PR01325 


6 


positive 


PR01434 


24 


positive 


PR04333 


6 


positive 




EXAMPLES 






Inhmitorv Activity in Mixed Lvmohocvte Reaction (MLR) Assay (no. 


67) 



This example shows that one or more of the PRO polypeptides are active as inhibitors of the 
proliferation of stimulated T-rymphocytes. Compounds which inhibit proliferation of lymphocytes are useful 
therapeutically where suppression of an itwmim#% response is beneficial. 
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10 



15 



20 



25 



The basic protocol for this assay is described in Current Protocols in Immunology, unit 3.12; edited 
by J. E. Coligan, A. M. Kniisbeek, D. H. Marglies, £. M. Shevach. W. Strober. National Institutes of Health. 
Published by John Wiley & Sons, Inc. 

More specifically, in one assay variant, peripheral blood mononuclear cells (PBMC) are isolated from 
mammalian individuals, for example a human volunteer, by leulcopberesis (one donor will supply stimulator 
PBMCs. the other donor will supply responder PBMCs). If desired, the cells are frozen in fetal bovine serum 
and DMSO after isolation. Frozen cells may be thawed overnight in assay media (37°C, 5% COj) and then 
washed and resuspended to 3x10* cells/ml of assay media (RPMI; 10% fetal bovine serum, 1% 
penicillin/streptomycin. 1% ghttamine. 1% HEPES. 1% non-essential amino acids. 1% pyruvate). The 
stimulator PBMCs are prepared by irradiating the cells (about 3000 Rads). 

The assay is prepared by plating in triplicate wells a mixture of: 

1 00:1 of test sample diluted to I % or to 0. 1 %, 

50:1 of irradiated stimulator cells, and 

50 :1 of respond er PBMC cells. 
100 microliters of cell culture media or 100 microliter of CD4-lgG is used as the control. The wells are then 
incubated at 37T. 5% CO, for 4 days. On day 5. each well i, pulsed with tritiated thymidine (1.0 mC/well: 
Amersham). After 6 hours the cells are washed 3 times and then the uptake of the label is evaluated. 

In another variant of this assay, PBMCs are isolated from the spleens of Balb/c mice and C57B6 mice. 
The cells are teased from freshly harvested spleens in assay media (RPMI; 10% fetal bovine serum, 1% 
pemcillm/streptomycin. 1% glutamine, 1% HEPES. 1% nonessential amino acids. 1% pyruvate) and the 
PBMCs are isolated by overlaying these cells over Lympholyte M (Organon Teknika), centrifogmg at 2000 
tpm for 20 minutes, collecting and washing the mononuclear cell layer in assay media and resuspendmg the 
cells to 1x10' cells/ml of assay media. The assay is then conducted as described above. 

Any decreases below control is considered to be a positive result for an inhibitory compound, with 
decreases of less than or equal to 80% being preferred However, any value less than control indicates an 
inhibitory effect for the test protein. 



Table 10 

£52 PRO Concentration 



30 PRO204 
PRO204 



PR0212 0.59 nM 

PR0212 5.9 nM 

P 110212 0.87 nM 

35 PR02 » 2 8.7 nM 

PR 0212 1.9 nM 



Percent Decrease Below Control 
0.1% S6 
1.0% 35 

0 



PR0212 



PR0212 0.46 nM 



52.6 
82.7 
66 
81.6 

19 nM 61J 



PR0212 



40 PR0212 ^ ^ 



66.1 

4.6 nM 595 



0 
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PRfftl 2 


4.1 mvi 


i ix •> 




21 hm 


U 




21 nM 


62*2 




u.13 nM 


74.3 


DDfv>i £ 


13 nM 


633 


rRU22o 


0.12 nM 


67.9 


PKU226 


1.2 nM 


40.6 


PRU235 


0 nM 


83.6 


PROZ35 


0.02 nM 


69.7 


PRO240 


5.3 nM 


68.2 


rKQ24U 


53 nM 


68.2 


DO Af\ 

PRQ24</ 


35 nM 


72.2 


PR024U 


350 nM 


64 


rKU24.} 


19.1 nM 


53 


DDA)i< 

rKU24.> 


191 nM 


54 


rKU24j 


0.93 nM 


71.8 


rKCJ24j 


0.93 nM 


80.9 


rKU24.> 


9.3 nM 


49.6 


dd rv>A < 


9.3 nM 


51.9 


PRU273 


31.46 nM 


81 


rKU27J 


314.56 nM 


67 


P©f\TJ*> 


0.35 nM 


74.2 


PR0332 


3.5 nM 


68 


PRU332 


035 nM 


20.2 


rKUJJZ 


33 nM 


61.2 


DDH1XI 

rKUJO 1 


1.5 nM 


63.2 


PD 


15 nM 


64.7 




8.6 nM 


76.9 




86 nM 


63.6 


PR0363 


o.o nM 


64.4 




5o nM 


2.1 


PR0364 


U.31 nM 


68.1 


rKU304 


3.1 nM 


67.4 


PR0364 


i.7nM 


92.8 


PR0364 


17 nM 


68.4 


PR0364 


1.7 hM 


942 


PR0364 


17 nM 


633 


PR0526 


0.12 nM 


68.5 


PR0526 


UnM 


623 


PR0531 


0 .2 nM 


66.1 
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PR0531 


2nM 


54J 


PR0531 


02 nM 


70.4 


PR0531 


2nM 


68.4 


PRO701 


0.74 nM 


72J 


PRO701 


0.74 nM 


90.2 


PRO701 


7.4 oM 


64.8 


PRO701 


7.4 nM 


69 


PRO770 


0.69 nM 


65.8 


PRO770 


6.9 nM 


67.4 


PR0788 


12.96 nM 


88.4 


PR0788 


129.6 nM 


57.7 


PR0788 


2.9 nM 


64.4 


PR0788 


29 nM 


67.4 


PR0865 


0.27 nM 


67.9 


PR0865 


2.7 nM 


63.7 


PRO 1 083 


7.1 nM 


80.5 


PRO1083 


71 nM 


63.7 


PRO 1083 


7.1 nM 


40.9 


PRO 1 083 


71 nM 


65 


PROU14 


0.37 nM 


44.9 


PROl 1 14 


3.7 nM 


42.4 


PROH92 


12.1 nM 


31.6 


PR01192 


121 nM 


32.6 


PROl 195 


0.5 nM 


67 


PROl 195 


5nM 


66.8 


PROl 250 


0.05 nM 


75.4 


PRO1250 


0.5 nM 


57.2 


PROI250 


0.05 nM 


94.6 


PRO1250 


0.5 nM 


61.2 


PROI312 


8JnM 


52 


PR01312 


85 nM 


493 


PR01312 


14.2 nM 


73.1 


PR01312 


142 nM 


62.9 


PR01287 


0.8 nM 


79.1 


PR01387 


8nM 


52J 


PRO1410 


4nM 


89 


PRO1410 


40 nM 


64.8 


PR014I8 


6.4 nM 


67.7 


PR014I8 


6.4 nM 


81.1 


PR01418 


64 nM 


56J 
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PRU141S 


64 nM 


64.9 




PR01868 


39.4 nM 


65.5 




PRO 1868 


394 oM 


50 




PR01917 


2.1 nM 


70.7 


5 


PR01917 


2.1 nM 


8X5 




PR01917 


21 nM 


60.7 




PR01917 


21 nM 


62.6 




PRO205 


0.7 nM 


71.5 




PRO205 


7nM 


3.5 


10 


PRO840 


24.4 nM 


137.2 




PR0982 


244 nM 


58.9 




PR0836 


2.5 nM 


60.7 




PR0836 


25 nM 


60.6 




PRO 1 159 


11.06 nM 


80.4 


15 


PRO 1 159 


1 10.55 nM 


57.6 




PRO 1159 


ll.06nM 


81.9 




PRO 11 59 


1 10.55 nM 


46.2 




PRO 1338 


0.14 nM 


80.7 




PRO 1338 


1.4 nM 


65.5 


20 


PRO4302 


1336 nM 


115.8 




PRO4302 


1 35.57 nM 


2.4 




PRO4430 


24.2 nM 


55.9 




PRO4430 


242 nM 


49.9 




PR05727 


19.6 nM 


69.2 


25 


PR05727 


196 nM 


54.5 



EXAMPLE 6 
In situ Hvbridi2ation 

In situ hybridization is a powerful and versatile technique for the detection and localization of nucleic 
30 acid sequences within cell or tissue preparations. It may be useful, for example, to identify sites of gene 
expression, analyze the tissue distribution of transcription, identify and localize viral infection, follow changes 
in specific mRNA synthesis and aid in chromosome mapping. 

In situ hybridization was performed following an optimized version of the protocol by Lu and Gillett. 
Cell Vision 1: 169-176 (1994), using PCR- generated 33 P-labeJed riboprobes. Briefly, formalin-fixed, 
35 paraffin-embedded human tissues were sectioned, deparaffinized. deproteinated in proteinase K (20 g/ml) for 
15 minutes at 37 6 C, and further processed for in situ hybridization as described by Lu and Gillett, supra. A 
[33p] UTP-labeled antisense riboprobe was generated from a PCR product and hybridized at 55 °C overnight 
The slides were dipped in Kodak NTB2 nuclear track emulsion and exposed for 4 weeks. 
33 P-Riboprobe synthesis 
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6.0 uJ (125 mCi) of 33 P-UI? (Amersham BF 1002, SA<2000 Ci/mmol) were speed vac dried. To 
each tube containing dried 33 P-UTP, die following ingredients were added: 2.0 ul 5x naisc^tion buffer: 1.0 
ul DTT(lOOmM); 2.0 ul NTP mix (2.5 mM : 10 ul; each of 10 mM GTP, CTP & ATP + 10 uJ H 2 0); 1.0 ui 
UTP (50 uM); 1.0 ul Rnasin; 1.0 pi DNA template (lug); 1.0 ul H 2 0. 
5 The tubes were incubated at 37°C for one hour. i.O uL RQ1 DNase were added, followed by 

incubation at 37*C for 15 minutes. 90 uL TE (10 mM Tris pH 7.6YlmM EDTA pH 8.0) were Hd r d . and the 
mixture was pipetted onto DE81 paper. The remaining solution was loaded in a Microcon-50 ultrafiltration 
unit, and spun using program 10 (6 mimnes). The filtration unit was inverted over a second tube and spun 
using program 2 (3 minutes). After the final recovery spin, 100 uL TE were 1 uL of the final product 

1 0 was pipetted on DE8 1 paper and counted in 6 ml of Biofluor II. 

The probe was run on a TBE/urea gel. 1-3 jxLof the probe or 5 uL of RNA Mrk HI were added to 3 
uL of loading buffer. After heating on a 95°C heat block for three minutes, the gel was immediately placed on 
ice. The wells of gel were flushed, the sample loaded, and run at 1 80-250 volts for 45 minutes The gel was 
wrapped in saran wrap and exposed to XAR film with an intensifying screen in -70°C freezer one hour to 

15 overnight 

33 P-Hybridization 

Prctrcatmcm of frozen sections The slides were removed from the freezer, placed on aluminum 
trays and thawed at room tempenuure for 5 minutes. The trays were placed in 55 'C incubator for five minutes 
to reduce condensation. The slides were fixed for 10 minutes in 4% paraformaldehyde on ice in the fume 
20 hood, and washed in 0.5 x SSC for 5 minutes, at room temperature (25 ml 20 x SSC + 975 ml SQ H 2 0). After 
deproteination in 0.5 ug/ml proteinase K for 10 minutes at 37°C (I2JuL of 10 mgmil stock in 250 ml 
prewarmcd RNase-free RNAse buffer), the sections were washed in 0 3 x SSC for 10 minutes at room 
temperature. The sections were dehydrated in 70%, 95%, 100% eihanoL 2 minutes each. 

Preireatmcm of paraffin -embedded sections The slides were deparaffinized, placed in SQ H->0. and 
25 rinsed twice in 2 x SSC at room temperature, for 5 minutes each time. The sections were deproteinated in 20 
ug/ml proteinase K (500 uL of 10 mg/mJ in 250 ml RNase-free RNase buffer. 37C, 15 minutes ) - human 
embryo, or 8 x proteinase K (100 uL in 250 ml Rnase buffer, 37°C, 30 minutes) - formalin tissues. 
Subsequent rinsing in 0.5 x SSC and dehydration were performed as described above. 

Prehybridization The slides were laid out in plastic box lined with Box buffer (4 x SSC, 50% 
30 formamide) - saturated filter paper. The tissue was covered with 50 uL of hybridization buffer (3.75 g Dextran 
Sulfate + 6 ml SQ H 2 0), vortexed and heated in the microwave for 2 minutes with the cap loosened. After 
cooling on ice, 18.75 mi formamide, 3.75 ml 20 x SSC and 9 ml SQ H 2 0 were added, the tissue was vortexed 
well, and incubated at 42 °C for 1 -4 hours. 

Hybridization 1.0 x 10 6 cp. probe and 1.0 uL RNA (50 mg/mJ stock) per slide were heated at 
35 95°C for 3 minutes. The slides were cooled on ice, and 48 uJ. hybridization buffer were added per slide. After 

vortexing. 50 uL 33 P mix were added to 50 uL prehybridization on slide. The slides were incubated overnight 
at55C. 
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Washes Washing was done 2x 1 0 minutes with 2xSSC, EDTA at room temperature (400 ml 20 x SSC 
+ 1 6 ml 0.25M EDTA, Vf=4L), followed by RNaseA treatment at 37 °C for 30 minutes (500 uL of 10 mg/ml in 

250 ml Rnase buffer - 20 ug/ml), The slides were washed 2x10 minutes with 2 x SSC, EDTA at room 
temperature. The stringency wash conditions were as follows: 2 hours at 55°C. 0.1 x SSC, EDTA (20 ml 20 x 
5 SSC + 1 6 ml EDTA, Vf=4L). 

Alternatively, multi-tissue blots containing poly A + RNA (2 ug per lane) from various human tissues 

were purchased from C Ion tech (Palo Alto, CA). DNA probes were labeled with [a--* 2 P]dCTP by random 
priming DNA labeling Beads (Pharmacia Biotech). Hybridization was performed with Expresshyb (Clontech) 
at 68 °C for 1 nr. The blots were then washed with 2X SSC/0.05% SDS solution at room temperature for 40 
10 min, followed by washes in 0. IX SSC/0. 1%SDS solution at 55°C for 40 min with one change of fresh solution. 
The blots were exposed in a phosphorimager. 

DNA 29101 fVEGFB9) I 
DNA2910I (SEQ ID NO:l) was examine in three separate in siiu studies wherein the following probes were 
15 used: 

VEGFB9-pl (SEQ ID NO: 1 94): 

5 , <}GATTCTAATACGACTCACTATAGGGCGGCGGAATCCAACCTCAGTAG-3 , 
VEGFB9-p2 (SEO ID NO: 195): 

S'-CTA TG A AAT TAA CCC TCA CTA AAG GG A GCG CCT ATC CTC CTG TGC TC-3' 

20 

IS97-029: 

Expression observed in the developing lower fetal limb bones at the edge of the cartilagenous anlage 
(/.*. around the outside edge); in developing tendons, in vascular smooth muscle and in cells embracing 
developing skeletal muscle myocytes and myotubes. Expression also observed in the following tissues: 
25 epiphyseal growth plate: lymph nodes - marginal sinus: thymus - subcapsular region of the thymic cortex, 
possibly representing either the subcapsular epithelial cells or the 

proliferating, double negative, thymocytes that are found in this region; tracheal smooth muscle: brain 
(cerebral cortex) - focal expression in cortical neurones; small intestine - smooch muscle; thyroid - thyroid 
epithelium; liver - ductal plates: stomach - mural smooth muscle: fetal skin - basal layer of squamous 
30 epithelium: placenta - interstitial cells in trophoblastic villi; spinal cord - no expression except in wall of 
arteries and veins. No expression was observed in the spleen and adrenals. 

The above expression pattern suggests that DNA29101 may be involved in cell differentiation 
/proliferation. 

35 IS97-037: 

Expression in superovulatcd rat ovaries were negative in aft* sections with both antisense and sense 
probes. Either the message is not expressed in this model, or the human probe does not cross react with rat 

IS97-087: 
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High expression levers were observed ai the following sites: chimp ovary - granulosa cells of 
manning follicles, lower intensity signal observed over thecal cells; chimp parathyroid - high expression over 
chief cells; human fetal testis - moderate expression over stromal cells surrounding developing tubules: human 
fetal lung - high expression over chondrocytes m developing bronchial tree, and low level expression over 
branching bronchial epithelium. 

Fetal tissues examined (E12-E16 weeks) include: placenta, umbilical cord, liver, kidney, adrenals, 
thyroid, lungs, heart, great vessels, esophagus, stomach, small intestine, spleen, thymus, pancreas, brain, eye, 
spinal cord, body wall, pelvis and lower limb. Aduh tissues examined include liver, kidney, adrenal, 
myocardium, aorta, spleen, lymph node, pancreas, lung, skin, cerebral cortex (rm). hippocampus(rm). 
cerebellum(rm). penis, eye, bladder, stomach, gastric carcinoma, colon, colonic carcinoma and 
chondrosarcoma. Also examined were acetaminophen induced liver injury and hepatic cirrhosis 



DNA30871 : 

IS97-044: In fetal tissues, strong signals were observed over neurones in fetal cerebral cortex, spinal cord, 
spinal ganglia as well as enteric neurones in the wall of the fetal stomach. Signal also observed over cells 
around the root of the aorta (possibly the cowiucting system), adrenal medulla, mesenchymal cells in 
neurovascular bundle, renal parenchyma and cells lying between skeletal muscle myocytes. All other fetal 
tissues negative. 

No expression was observed in adult tissue. Fetal tissues (12-16 weeks) examined include: placenta, 
umbilical cord, liver, kidney, adrenals, thyroid, lungs, heart, great vessels, esophagus, stomach, small intestine, 
spleen, thymus, pancreas, brain, eye. spinal cord, body wall, pelvis and lower limb. Adult tissues examined 
include: liver, kidney, adrenal, myocardium, aorta, spleen, lymph node, pancreas, lung and skin. 
The probes used in the above analysis were the following: 
DNA30871-p1 (SEQ ID NO: 196): 

T-GGA TTC TAA TAC G AC TCA CTA TAG GGC CTC CCG TCT CCT CCT GTC CTC-T 
DNA3087l-p2 (SEQ ID NO:197): 

S-CTA TG A AAT TAA CCC TCA CTA AAG GG A CCT CGG CAT CTT CGT CAC ATT-3* 
DNA30942 : 

DNA30942 (SEQ ID NO: 13) was examined in four separate in situ studies (including two in the diseased tissue 
study of Example 7 using the following probes: 
DNA30942-pl (SEQ ID NO: 1 98) 

5*-GGA TTC TAA TAC GAC TCA CTA TAG GGC TCG CTG CTG TGC CTC GTG TTG-3' 
DNA30942-p2: (SEQ ID NO: 199) 

5'-CTA TCA AAT TAA CCC TCA CTA AAG GGA CCG CTC C AG CCT CTT GAT GGA-3' 

IS97-043: 
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No expression was observed in fetal tissues. The fetal tissues examined included: placenta, umbilical 
cord, brain, spinal cord, eye, optic nerve, trachea, lung, heart thymus, liver, spleen, esophagus, small intestine, 
pancreas, adrenal, thyroid, body wall and lower limb. 

No expression was observed in adult tissues. The adult tissues examined included liver, kidney, adrenal, 
5 myocardium, aorta, spleen, lymph node, pancreas, lung and skin. 

DNA33087 aS97-051): 

In fetal tissue, expression of DNA33087 (SEQ ID NO: IS) was observed in osteoblasts at all sites of 
enchondral and periosteal new bone formation, the developing pulmonary arterial and aortic trunks. The fetal 
10 tissues examined included: placenta, umbilical cord, brain, spinal cord, eye, opuc nerve, trachea, lung, heart, 
thymus, liver, spleen, esophagus, small intestine, pancreas, adrenal, thyroid, body wall and lower limb. 

No expression was observed in the adult tissues examined including: Liver, kidney, adrenal, 
myocardium, aorta, spleen, lymph node, pancreas, lung and skin. 
The probable role in control of bone matrix deposition and or osteoblast growth. 
15 All adult tissues in the multtblock were positive for heta-actin. 

The probes used in this procedure were the following: 

DNA33087-pl (SEQ ID NO:200): 

f-GGA TTC TAA TAC GAC TCA CTA TAG GGC CCC GAG TGT TTT CCA AGA-3' 
20 DNA33087-p2 (SEQ IDNO:201 ): 

y-CTA TGA AAT TAA CCC TCA CTA AAG GGA C AA GTT TAC TAG CCC ATC CAT-3* 
DNA33087-p3 (SEQ ID NO:202): 

5'-GGA TTC TAA TAC GAC TCA CTA TAG GGC TGG ATG GGC TAG TAA ACT TGA-3* 
DNA33087-p4 (SEQ ID NO:203): 
25 .V-CTA TGA AAT TAA CCC TCA CTA AAG GGA CCC TTC TGC TCC TTC TTG TT-V 

DNA34387 {IS97-I09): 

The expression pattern of DNA34387 (SEQ ID NO:25) was observed in fetal and adult human tissues at the 
following sites: 

30 Fetal - thyroid epithelium, small intestinal epithelium, gonad, pancreatic epithelium, hepatocytes in liver and 
renal tubules. Expression also seen in vascular tissue in developing long bones. 

Adult - Moderate signal in placental cytotrophoblast, renal tubular epithelium, bladder epithelium, parathyroid 
and epithelial tumors. 

The fetal (E12-EI6 weeks) tissues examined included: placenta, umbilical cord, liver, kidney, adrenals, 
35 thyroid, lungs, heart, great vessels, esophagus, stomach, small intestine, spleen, thymus, pancreas, brain, eye, 
spinal cord, body wall, pelvis and lower limb. 

The adult human tissues examined: kidney (normal and end-stage), adrenal, myocardium, aorta, spleen, lymph 
node, gall bladder, pancreas, lung, skin, eye (inc. retina), prostate, bladder, liver (normal cirrhotic, acute 
failure). 

40 The non-human primate tissues examined included the following: 
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Chimp tissues* Salivary gland, stomach, thyroid, parathyroid, skin, thymus, ovary, lymph node. 
Rhesus Monkey Tissues: Cerebral cortex, hippocampus, cerebellum, penis. 

The probes used in this procedure were the following: 
DNA34387-pl (SEQ ID NO:206): 
5 T-GGA TTC TAA TAC G AC TCA CTA TAG GGC CCG AGA TAT GCA CCC AAT GTC-T 
DNA34387-p2 (SEQ ID NO:207): 

5* -CTA TGA AAT TAA CCC TCA CTA AAG GG A TCC CAG AAT CCC GAA GAA CA-3' 

DNA35638 : 
10 JS97-078: 

Expression of DNA35638 (SEQ ID NO:35) was observed in the endothelium lining a subset of fetal and 
placental vessels. Endothelial expression was confined to these tissue blocks. Expression also observed over 
intermediate neophobias! cells of placenta. 

The fetal tissues examined (E12-E16 weeks) included: placenta, umbilical cord, liver, kidney, 
15 adrenals, thyroid, lungs, heart, great vessels, esophagus, stomach, small intestine, spleen, thymus, pancreas, 
brain, eye. spinal cord, body wall, pelvis and lower limb. 

The adult tissues examined included: liver, kidney, adrenal, myocardium, aorta, spleen, lymph node, pancreas, 
lung, skin, cerebral cortex (rm), hippocampus(rm). cerebellum(rm), penis, eye, bladder, stomach, gastric 
carcinoma, colon, colonic carcinoma, thyroid (chimp), parathyroid (chimp) 
20 ovary (chimp) and chondrosarcoma. Also examined was tissue derived from acetaminophen induced liver 
injury and hepatic cirrhosis. 

The oligos used for the above procedure were the following: 
DNA35638-pl (SEQ ID NO:208): 

5-GGA TTC TAA TAC GAC TCA CTA TAG GGC GGG AAG ATG GCG AGG AGG AG-3' 
25 DNA35638-p2 (SEQ ID NO:209): 

5'-CTA TGA AAT TAA CCC TCA CTA AAG GGA CCA AGG CCA CAA ACG GAA ATC-3' 

DNA39523 : 

The following probes were used in the in situ studies below: 
30 DNA39523-p 1 (SEQ ID NO:2 1 0): 

5*-GGA TTC TAA TAC GAC TCA CTA TAG GGC AGC GCA CGG CCA CAG ACA-3* 
DNA39S23-p2 (SEQ ID NO:21 1): 

5'-CTA TGA AAT TAA CCC TCA CTA AAG GGA GAC CCT GCG CTT CTC GTT CCA-3' 
35 198-052: 

DNA39523 (SEQ ID NO:45) in normal human skin (neonatal foreskin) and adult psoriatic; skin both 
exhibited specific strong expression in the epithelial cells of the stratum basale - (he single layer along the 
basement membrane which is the progenitor for all of the overlying epidermal cells in the skin. 

There was no expression in epidermal cells in the overlyin g layers (stratum spinosum, straum 
40 granulosum, etc.). The intensity of the signal was slightly increased in psoriatic skin. Expression was also 
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apparent in the dermis (the connective tissue immediately underlying the epidermis) of both normal and 
psoriatic skin. Expression here was most apparent in spindle shape cells within the collagen matrix - the 
stromal fibroblasts. 

In the brain, sections of cerebrum had strong specific expression in a subset of superficial cortical 
5 neurons - a distinct pattern suggestive of a specific population of cortex neurons. 

In inflamed and normal bowel: Normal human large bowel and bowel with cither Crohns's disease or 
ulcerative colitis had specific moderate to strong expression in a multifocal pattern within the lamina propria of 
villi. The cells labeled by in situ were spindloid stromal cells best delineated as 

fibroblasts. There was no expression by intestinal epithelial cells and there was no apparent increased 
10 expression (intensity or frequency) in diseased bowel. Specifically there was also no correlation of expression 
and lesions in the inflamed bowel 

In human fetal kidney, there was specific weak to moderate expression in multifocal developing 
tubules: expression was in the tubular epithelium in these foci. 

The expression of DNA39523 (SEQ ID NO:45) in the skin and specific localization to the basal 
15 epithelial cells of the epidermis cells suggests a potential role in differemiation/maintenance of the basal 
epidermal cells. This expression pattern in combination with the fact that expression occurs in cells that are 
directly adjacent to the basement lamina, suggests that the cells regulate trafficking of leukocytes into the 
epidermis. As a result DNA39523 (SEQ ID NO:45) may be a constitutively expressed signal for the 
trafficking of dendritic/Langerhan cells or lymphocytes into the epidermis. Such trafficking is a normal 
20 physiologic event that occurs in normal skin and is thought to be involved in immunosurvcil lance of the skin. 

The expression of DNA39523 (SEQ ID NO: 45) in inflammatory bowel disease was not increased 
from normal tissue, and there was no correlation of its expression to inflammatory lesions. Similarly, its 
expression in the basal epidermal cells in psoriatic skin lesions was equivalent to or only slightly greater than 
that seen in normal neonatal skin (but age-matched control adult skin was not available at the time of the 
25 study). 

IS97-i28: 

The expression of DNA39523 (SEQ ID NO:45) was observed in the epithelium of mouse embryo skin 
as well as the basal epithelium and dermis of human fetal skin. The basal epithelial pegs of the squamous 

30 mucosa of the chimp tongue are also positive. Expression was also observed in a subset of cells in developing 
glomeruli of fetal kidney, adult renal tubules, and over "thyroidized" epithelium in end-stage renal disease. 
However, low expression was also seen in a renal cell carcinoma, probably over the epithelial cells. 
Expression was also observed in the stromal cells both (1) at low levels in fetal lung, and (2) in the apical 
portion of gastric glands. High expression was indicated in the lamina propria of the fetal small intestinal villi, 

35 normal colonic mucosa and over stromal cells in a colonic carcinoma. Strong expression occurred in benign 
connective tissue cells in the hylanized stroma of a sarcoma. Expression also occurred in stromal cells in the 
placental villi and the splenic red pulp, w the brain, expression occurred in corneal neurones. 

DNA39523 (SEQ ID NO:45) was also expressed in the connective tissue surrounding developing 
bones and over nerve sheath cells in the fetus. 
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The fetal tissues examined (EJ2-EI6 weeks) included: placenta, umbilical cord liver, kidney, 
adre na l s , thyroid, lungs, heart, great vessels, esophagus, stomach, small intestine, spleen, thymus, pancreas, 
brain, eye, spinal cord, body wall, pelvis and lower limb. The adult tissues examined included: liver, ki dney, 
adrenal, myocardium, aorta, spleen, lymph node, pancreas, lung, skin, cerebral cortex (rm), hippocampus(nrj), 
eye, stomach, gastric carcinoma, colon, colonic carcinoma, thyroid (chimp), parathyroid (chimp) ovary (dump) 
and chondrosarcoma. Also examined included acetaminophen induced liver injury and hepatic cirrhosis. 

IS98-092: 

The expression of DNA39523 (SEQ ID N0.45) was present in many cells in the outer layers (I and II) 
of the monkey cerebral cortex. A small subset of cells in the deeper conical layers also expressed mRNA for 
the chemokine homolog. Scattered cells within the molecular layers of the hippocampus and bordering the 
inner edge of the dentate gyrus showed expression of DNA39523 (SEQ ID NO:45). No expression was 
detected within the cerebellar cortex. Expression of DNA39523 (SEQ ID NO:45) was not observed in 
mfarcted brain, where cell death has occurred in die regions where the chemokine homolog normally is 
expressed. DNA39523 (SEQ ID NO:45) could possibly serve as a marker of a subset of neurons of outer 
layers of the cerebral cortex and could possibly reveal neuronal migration disorders. Abnormal neuronal 
migration is a possible cause of some seizure disorders and schizophrenia. 



1598- J28: 

DNA39523 (SEQ ID NO:45) showed intriguing and specific patterns of hybridization within postnatal 
day (P)I0 and adult mouse brains. In one sagittal section of P10 mouse brain, strong signal was observed 
scattered within the molecular layer of the Wppocampus and inner edges of the dentate gyms. Cells in the 
presubiculum were moderately labeled: die signal extended in a strong band through outer layers of the 
retrosplenial cones to the occipital cortex, where the signal diminished to background levels. A small set of 
positive neurons were detected in deeper regions of PIO motor cortex: neurons in outer layers of P10 cortex did 
not exhibit signal above background levels. Moderate hybridization signal was also detected in the inferior 
colliculus. Chemokine homolog signal in the adult mouse brain was evaluated in three coronal sections at 
different levels. Strong signal was detected in the septum and in scattered neurons in the pontine nuclei and 
motor root of the trigeminal nerve; moderate signal was seen in the molecular layers of the hippocampus and 
outer layers of the retrosplenial cortex. 

1599- 027: 

Bolekine (also known as BRAK - the chemokine to which DNA39523 (SEQ ID NO:45) bear 
significant homology) belongs to a chemokine subgroup characterized by a cys-x-cys (CXC) motif and absence 
of an animc-terainal glu-leu-arg (ELR). Non-ELR CXC (includingSDF-1. IP10, Mig and PF4) are 

chemotactic for subsets of leukocytes including B and T lymphocytes. They also have angjostaric activity. 

DNA39523 (SEQ ID NO:45) was detected in Postnatal day (P) 1 mouse brain, bolekine signal was 
detected in the birjpocanrpus (stratum lacunosum moleculare and hflus of the dentate gyrus) and anterior 
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olfactory nucleus, but not in the developing cerebral cortex or cerebellum. By P10, signal is present in a suoset 
of cells in layers 1 & 2 of the cerebral cortex. A small population of cells in the deeper layers also express 
DNA39523 (SEQ IDNO:45). The pattern in the hippocampus resembled the P I brain. Weak signal is present in 
the cerebellum, especially lobules IX and X. Signal is also present in the dorsal striatum and colliculi. 
5 in the adult mouse brain, bolekine-positive cells were difficult to detect in the adult cerebral cortex, 

but signal is present in the anterior olfactory nucleus and hippocampus. In ischemic mouse brains, however, 
bolekine signal is induced in the penumbra. 

In the Developing cerebral cortex, bolekine expression correlates with final stages of neuronal 
migration and the establishment of axonal projections and synaptogencsis. Other CXC chemokines have roles 
10 in neuronal migration and patterning in the central nervous system (SDF-l) and modulation of neuronal 
activity (IL-8 and GRO-a). 

Bolekine expression is induced in ischemic -leperfusion injury in the brain, but not in other 
inflammatory states. 

/ 

15 DNA47365 (IS97-1421: In fetal tissues, the expression of DNA47635 (SEQ ID NO:9l) was observed in die 

foscia lining the anterior surface of the vertebral body. There is expression over the fetal retina. Low level 

expression over fetal neurones. 

The following probes were used in the above analysis: 

DNA47365-pl (SEQ ID NO:214): 
20 S'-GGA TTC TAA TAC GAC TCA CTA TAG GGC AAC CCG AGC ATG GCA CAG CAC-3* 

DNA47365-p2 (SEQ ID NO:215): 

5'-CTA TGA AAT TAA CCC TCA CTA AAG GG A TCT CCC AGC CGC CCC TTC TC-3' 
PNA49435 (IS97-IS6): 

25 Moderate expression of DNA49435 (SEQ ID NO: 1 11) was observed over conical neurones in the 

fetal brain. Expression was also present over the inner aspect of the fetal retina, possible expression in the 
developing lens. Expression was seen over fetal skin, cartilage, small intestine, placental villi and umbilical 
cord. In adult tissues there is an extremely high level of expression over the gallbladder epithelium. Moderate 
expression of DNA49435 (SEEQ ID NO: 111) was seen over the adult kidney, gastric and colonic epithelia. f 

30 The human fetal tissues examined (E12-E16 weeks) included: placenta, umbilical cord, liver, kidney, 

adrenals, thyroid, lungs, heart, great vessels, esophagus, stomach, small intestine, spleen, thymus, pancreas, 
brain, eye, spinal cord, body wall pelvis, testis and lower limb. The adult human tissues examined included: 
kidney (normal and end-stage), adrenal, spleen, lymph node, pancreas, lung, eye (inc. retina), bladder, liver 
(normal, cirrhotic, acute failure). 

35 The non-human primate tissues examined included the adrenal glands from chimp tissues and the 

cerebral cortex, hippocampus and cerebellum of rhesus monkey tissues. 

The probes used in the above analysis were the following: 
DNA49435-pl (SEQ ID N0218): 

5M3GA TTC TAA TAC GAC TCA CTA TAG GGC GGA TCC TGG CCG GCC TCT G-3' 
40 DNA49435-p2 (SEQ ID NO:219): 
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y-CTA TGA AAT TAA CCC TCA CTA AAG GGA GCC CGG GCA TGG TCT CAG TTA-3' 
DNA54228 fIS98-I0S): 

Expression of DNA54228 (SEQ ID HO: 133) was observtd in bone spicules: fetal metaphyseal bone, 
5 fetal calvarimn (skull) and bone tissue in human neoplasia (osteosarcoma and chondrosarcoma). There is weak 
but consistent signal in small bone spicules in the metaphysis of fetal bone and in ossified spicules in a 
chondrosarcoma and an osteosarcoma. No signal was detected in human lung, liver, thymus, kidney, thyroid, 
brain, spleen, fetal tissues including adrenal, brain, cartilage, lung, liver, intestine, gonad, heart and skin. 
The probes used in the above procedure were the following: 
1 0 hmDETl-p 1 (SEQ ID NO:220): 

5'-GGA TTC TAA TAC GAC TCA CTA TAG GGC ACC ACC ACC CAG GAG C-3' 
hmDETI-p2 (SEQ IDNO:221 ): 

5'-CTA TGA AAT TAA CCC TCA CTA AAG GGA AAT GAA GTG GGA CGT TTG AGT-3' 
DNA54228-pl (SEQ ID NO:222): 
1 5 5'-GGA TTC TAA TAC GAC TCA CTA TAG GGC CTT CTT TCC TTC ACC ACC ACC-3* 
DNA54228-p2 (SEQ ID N0023): 

5'-CTA TGA AAT TAA CCC TCA CTA AAG GGA TCT GCC TTG GCT TTT GAC AC-3' 

DNA5423HmFIZZ3) : 
20 IS98-070-. 

DNA54231 (SEQ ID NO: 139) showed a moderate signal that is specific to adipocytes. This signal 
was present in mesenteric fat and in interstitial fat in the neck around the trachea. The expression pattern 
appears to be specific for adult fat 

25 IS98-W. 

The expression of DNA5423 1 (SEQ ID NO: 1 39) was specific to adipocytes and was present wherever 
such cells were found which in this study included the peritoneal mesentery, perirenal fat in the renal pelvis, 
and the mammary fat pad. There was no expression in any other cell type in normal murine brain, liver, 
kidney, mammary gland, pancreas, spleen, pancreas, bone marrow, stomach, duodenum, jejunum, ileum, colon, 
30 cecum, testis, skin, or lung. 

The selective distribution of this molecule to adipocytes suggests a role in either fat metabolism or the 
pfoducttWgenesis of adipocytes, either of which is important in obesity. 

The probes used for the above procedure were the following: 
DNA5423I-pl (SEQ ID N0224): 
35 5'-GGA TTC TAA TAC GAC TCA CTA TAG GGC CGA GGG GGA CAG GAG CTA ATA-3' 
DNA5423 1 -p2 (SEQ ID NO:225): 

F-CTA TGA AAT TAA CCC TCA CTA AAG GGA GTC CCA CGA GCC AC A GG-3' 
DNA59294 (1S98-I38): 
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DNA 59294 (SEQ ID NO: 149) was evaluated in a panel consisting of normal adult and tetal tissues 
and tissues with inflammation, predominantly chronic lymphocytic inflammation. In summary, the expression 
was specific to muscle, certain types of smooth muscle in the adult and in skeletal and smooth muscle in the 
human fems. The expression in adult human was in smooth muscle of tubular organs evaluated including 
5 colon and gall bladder. There was no expression in the smooth muscle of vessels or bronchi. No adult human 
skeletal muscle was evaluated. In fetal tissues there was moderate to high diffuse expression in skeletal muscle 
the axial skeleton and limbs. There was weak expression in the smooth muscle of the intestinal wall but no 
expression in cardiac muscle. 

In adult tissues, the colon showed a low level of diffuse expression in the smooth muscle (tunica 
10 xnuscularis) in 5 specimens with chronic inflammatory bowel disease. In the gall bladder, there was weak to 
low level expression in the smooth muscle of the gall bladder. 

In fetal human tissues, there was moderate diffuse expression in skeletal muscle and weak to low 
expression in smooth muscle. However expression was not detected in the fetal heart or any other fetal organ 
including liver, spleen. CNS. kidney, gut. lung. 
15 The additional human tissues tested whh no delectable expression included: lung with chronic 

granulomatous inflammation and chronic bronchitis (5 patients), peripheral nerve, prostate, heart, placenta, 
liver (disease mulriblock including acctomihopin induced injury and cirrhosis), brain (cerebrum and 
cerebellum), tonsil (reactive hyperplasia), peripheral lymph node, thymus. 

The probes used in the above procedure were the following: 
20 626.pl (SEQ ID NO:226): 

5'-GGA TTC TAA TAC GAC TCA CTA TAG GGC CGG AAT GG A CTG GCC TCA CAA-3' 
626.p2 (SEQ ID NO:227): 

5'-CTA TG A AAT TAA CCC TCA CTA AAG GGA AGG ATG GTC TCG GGC TGC TG-3' 

25 PNA30S68 US97-Q44) 

DNA30R68 expression was found in the following fetal tissues: spinal cord, autonomic ganglia, 
enteric nerves, sacral plexus, peripheral and cranial nerves. 

"Die fetal tissues examined were the following: Placenta, umbilical cord, liver, kidney, adrenals, thyroid, lungs, 
heart, great vessels, esophagus, stomach, small intestine, spleen, thymus, pancreas, brain, eye, spinal cord, 
30 body wall pelvis and lower limb. 

The adult tissues examined included: Liver, kidney, adrenal, myocardium, aorta, spleen, lymph node, pancreas, 
lung and skin. 

The probes used for the above procedure were the following: 
DNA30S68.pl (CI 1 1-G): (SEQ ID NO:304) 

33 5'-GGA TTC TAA TAC GAC TCA CTA TAG GGC AGA GAC AGG GCA AGC AGA ATG-3* 
DNA30868.p2 (CI U-H): (SEQ ID NO:305) 

5'-CTA TGA AAT TAA CCC TCA CTA AAG GGA G AA GGG GAT GAC TGG AGG AAC-3' 

DNA53517 : 
40 IS9W70: 
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DNA53517 (SEQ ID NO:255) expression in the norma] adult murine lung was patchy, with expression in a 
subset of mucosal epit hel ial cell in the large airway (bronchi/bronchioles) . There is also expression within the 
rare discrete cells in the submucosal intersnuum adjacent to the large airways. These cells, typically 1 -3 within 
a positive focus, are adjacent to large vessels and may repres ent smooth muscle cells, peripheral nerves or 
5 Schwann cells, or lymphatics. 

Id the murine adult lung with allergic inflammation (eosinophilic, lymphocytic vasculitis, 
bronchiolitis and pneumonitis), there was diffuse strong expression in all mucosal epithelial cells of all of the 
large airways (bronchi/bronchioles) of the lung. There was also strong expression in discrete cells that 
represent a subset of epithelial cells that line the alveoli: these cells are type 0 pneumocytes. There is also 
10 expression, as in normal lung, present within rare discrete cells in the submucosal intersdrium adjacent to the 
large airways. 

In normal adult murine small and large intestine, there is strong expression within multifocal few 
discrete single cells that are present in the submucosa. the tunica muscularis and the mesentery. The cells that 
express the signal are almost always associated with nerve, vein, artery triads within these areas. These cells 
1 5 are spindle shaped and may be either a peripheral nerves. Schwann cells 

associated with such nerves or some type of support cell associated with vessel or lymphatics. Interestingly, 
there is no expression within identifiable myenteric plexi that are present within the tunica muscularis. 

In inflamed large bowel (from an IL10R KO mouse) the pattern of expression is similar but 
expression level is significantly decreased. 

20 

IS98-093: 

The distribution of DNA53715 (SEQ ID NO:255) was further evaluated in a broad screen of normal 
murine tissues. In normal lung, expression is variable but when present was restricted to murine bronchial 
epithelial cells and type n alveolar cells in the lung. There is a marked increase in expression in these cells in 

25 inflamed lung (allergic inflammation with bronchial mucosal hypertrophy/hyperplasia; asthma model). The 
expression of DNA53715 (SEQ ID NO:255) in the bowel is most prominent in the colon and is present in few 
discrete cells within the submucosa and mucosa muscularis. the thin, well vascularized tissue layer between the 
muscle wall of the bowel and the mucosa proper. The exact identity of these cells has not been delineated, 
however, their spindloid morphology and close association to capillaries and small vessels in the submucosa 

30 suggest the following possibilities: a subset of vascular pericytes or non-myelinated nerve fibers. 

The expression of DNA53715 (SEQ ID NO:255) in discrete cells in the bowel submucosa was 
restricted to die colon and was not seen in sections of jejunum, ileum, proximal duodenum or stomach. 
Expression was not detected in the following normal murine tissues: liver, kidney, spleen, bone marrow, lung, 
pancreas, stomach, proximal duodenum, jejunum, ileum, brain, skin, testis, or mammary glands 

35 It is possible that DNA53715 (SEQ ID NO:255) has a role in erihancing or simulating mucosal 

immunity in the lung. 

The probes used for the above procedure were the following: 

DNA53517.pl (C301-P): (SEQ m N0 J0 8) 

5*-GGA TTC TAA TAC GAC TCA CTA TAG GGC CCC AGO ATG CCA ACT TTG A-3 f 
40 DNA5351 7.p2 (C301-Q): (SEQ N O309) 
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5'-CTA TGA AAT TAA CCC TCA CTA AAG GGA AGG AGG CCC ATC TGTTCA TAG-3' 
EXAMPLE 7 

In siiu Hybridization in Cells and Diseased Tissues 
5 The in situ hybridization method of Example 6 is used to determine gene expression, analyze the 

tissue distribution of transcription, and follow changes in specific xnRNA synthesis for the genes/DNAs and the 
proteins of the invention in diseased tissues isolated from human individuals suffering from a specific disease. 
These results show more specifically where in diseased tissues the genes of the invention are expressed and are 
more predictive of the particular localization of the therapeutic effect of the inhibitory or stimulatory 
10 compounds of the invention (and agonists or antagonists thereof) in a disease. Hybridization is performed 
according to the method of Example 6 using one or more of the following tissue and cell samples: 

(a) lymphocytes and antigen presenting cells (dendritic cells. Langherhans cells, macrophages and 
monocytes. NK cells); 

(b) lymphoid tissues: normal and reactive lymph node, thymus. Bronchial Associated Lymphoid 
1 5 Tissues. (BALT). Mucosal Associated Lymphoid Tissues (MALT); 

(c) human disease tissues: 

• Synovium and joint of patients with Arthritis and Degenerative Joint Disease: 

• Colon from patients with Inflammatory Bowel Disease including Ulcerative Colitis and 
Crohns' disease: 

20 • Skin lesions from Psoriasis and other forms of dermatitis; 

• Lung tissue including BALT and tissue lymph nodes from chronic and acute bronchitis, 
pneumonia, pneumonitis, pleuritis: 

• Lung tissue including BALT and tissue lymph nodes from Asthma; 

• nasal and sinus tissue from patients with rhinitis or sinusitis; 

25 • Brain and Spinal cord from Multiple Sclerosis. Alzheimer's Disease and Stroke: 

• Kidney from Nephritis. Glomerulonephritis and Systemic Lupus Erythcmatosis: 

• Liver from Infectious and non-infectious Hepatitis and acetaroinophen-induccd liver cirrhosis; 

• Tissues from Neoplasms/Cancer. 

30 Expression is observed in one or more cell or tissue samples indicating localization of the therapeutic 

effect of the compounds of the invention (and agonists or antagonists thereof) in the disease associated with the 
cell or tissue sample. 

The sequences of the oligonucleotides used, where expression overlaps with the non-diseased tissue 
distribution reported earlier is recited in Example 6. 

35 

DNA30942 : 

IS98-02J : Expression was observed in mononuclear phagocytes in the normal chimp thymus, as well as in a 
gastric carcinoma (l/l) colorectal cancer (1/1), breast cancer (2/5) and a lung cancer (1/4). Expressed by 
malignant cells in an osteosarcoma and a poorly differentiated liposarcoma. Possible signal in the m a l i g n a nt 
40 cells of a testicular teratoma and breast cancers (1/5). In one of the lung cancers scattered signal is seen over a 

146 

SUBSTITUTE SHEET (RULE 26) 

(C) 2003 Copyright Derwent Information Ltd. 



WO 00/53758 



PCT/USOG/05841 



10 



high endothelial venule within pulmonary lymphoid tissue. The fetal tissues examined (E12-E16 weeks) 
wduded: placenta, umbuicaJ cord, liver, kidney, adrenals, thyroid, lungs, heart great vessels, esophagus, 
stomach* small intestine, spleen, thymus, pancreas, brain, eye, spinal cord, body wall pelvis and lower limb. 
The adult human tissues examined included: liver, kidney, adrenal, myocardium, aorta, spleen, lung. skin, 
chondrosarcoma, eye, stomach, gastric carcinoma, colon, colonic carcinoma, renal cell carcinoma, prostate, 
bladder mucosa and gall Madder. Also examined was tissue derived from acetaminophen induced liver injury 
and hepatic cirrhosis. The rhesus tissues examined include: cerebral cortex (rm), hippocampus(rm>. The 
chimp tissues examined included: thyroid, parathyroid, ovary, nerve, tongue, thymus, adrenal, gastric mucosa 
and salivary gland. 

IS98-085: Expression was observed in eight adenocarcinomas and seven squamous lung carcinomas. Actins 
were strongly positive in all tumors, indicating that all are suitable for in sim hybridization analysis. 
Expression of DNA30942 was observed in 6 of the rumors as follows: 
6727-95 / squamous carcinoma - Strongly expressed over neoplastic epithelium: 
9558-95 / squamous carcinoma - Expression over neoplastic epithelium: 
1 5 1 2235-95 / adenocarcinoma - Expression over in situ and infiltrating rumor cells: 

6545-95 & 4187-96 / squamous carcinomas - Expression over cells in tumor stroma, no expression seen over 
tumor cells: 

12954-94 / squamous carcinoma - possible weak expression over stromal cells. 
20 IS99-U2: 

The in situ expression of DNA30942 (SEQ ID NO:13) was evaluated numerous chronic inflammatory 
conditions and lymphoid organs. In summary, DN A3 0942 (SEQ ID NO: 13) was strongly expressed in high 
endothelial venules (HEV) in the tonsil hilar lymph node, bronchial nnicosal-assodated lymphoid tissue 
(BALT) in chronic asthma, patchy expression in colonic mucosa and weak inconsistent expression in gut- 
25 mucosal associated lymphoid tissues (GALT) HEV. 

In lymphoid tissues, there was observed strong specific expression in single sections of tonsil hilar 
lymph node, bronchial mucosal-associated lymphoid tissue BALT) in a case of chrome asthma, and in gut 
mucosal associated lymphoid tissues in sections of IBD (GALT/MALT). In each of these lymphoid organs 
expression specifically was present in faigh-endotheliai venules (HEV). 

In tissue in a chronic asthmatic lung, additionally to expression in BALT HEVs, specific expression 
was observed in small capillaries lined with high or reactive swollen endothelial cells in the submucosa of 
inflamed bronchi This region was not intimately associated with BALT but was specific to the submucosal 
site for mftamwny cell trafficking to the bronchi. There was a significant submucosal infiltrate of 
eosinophils in these areas. In other sections of diseased lung (COPD and chronic interstitial pneumonia) there 
35 was not any expression of DN A3 0942 (SEQ ID NO: 13), these sections had some artifact (loss of tissue from 
slide). 

to psoriatic tisinie, there capillaries m r^riatic plaques. In 

tonsilar tissue, additional to expression in HEVs associated with follicles, mere was also strong expression 
whJmt me reticulated toijsi^ 



30 
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capillaries present Expression was also within some of the epithelial cells. This is an important 
immunological site and is involved with antigen presentation and may play a role in tolerance induction. 

In tissue isolated from patients suffering from CrohmV Disease and ulcerative colitis, colons! 
expression was present in the mucosa with patchy distribution in some but not in all cases. Expression in HEV 
5 in GALT was present as a significantly weaker signal than seen in other lymphoid tissues and was not 
consistently present even in sections where there was strong but patchy expression in the mucosa. 

In tissue isolated from acetaminophen induced liver injury and cirrhosis, there was weak expression in 
small capillaries within areas in the portal tracts with chronic lymphocytic inflammation. 

10 DNA33460 (IS98-01S): 

The expression of DN A3 3460 (SEQ ID NO:20) was observed over cells in loose connective tissue 
immediately adjacent to developing extra ocular muscle in the fetal eye. Moderate expression over soft-tissue 
sarcoma. The fetal tissues examined (E12-E16 weeks) included: placenta, umbilical cord, liver, kidney, 
adrenals, thyroid, lungs, heart, great vessels, esophagus, stomach, small intestine, spleen, thymus, pancreas. 

15 brain, eye. spina) cord, body walL pelvis and lower limb. The aduh tissues examined included the liver, 
kidney, renal cell carcinoma, adrenal, aorta, spleen, lymph node, pancreas, lung, myocardium, skin, cerebral 
cortex (rm), hippocampus (mi), cerebellum (rm), bladder, prostate, stomach, gastric carcinoma, colon, colonic 
carcinoma, thyroid (chimp), parathyroid (chimp) ovary (chimp) and chondrosarcoma. Also examined was 
tissue extracted from acetaminophen induced liver injury and hepatic cirrhosis. 

20 The probes used in mis procedure were the following: 

DNA33460.pl (SEQ ID NO:204): 

5 , -CGA TTC TAA TAC GAC TCA CTA TAG GGC CAG CAC TGC CGG GAT GTC AAC-3* . 
DNA33460-p2 (SEQ ID NO;205): 

y-CTA TGA AAT TAA CCC TCA CTA AAG GGA GTT TGG GCC TCG GAG CAC TG-3* 

25 

DNA34387 OS98-08S): 

Expression observed in lung cancer tumors and was positive in all eight squamous carcinomas and in 
678 adenocarcinomas. Expression levels are low to moderate in the adenocarcinomas and very strong in the 
squamous carcinomas. No expression was seen in the tumor stroma, alveoli or normal respiratory epithelium. 
30 Possible low level expression in the lymph nodes. 

Expression was observed in lung cancer. The gene was amplified in Taqman analysis of a lung tumor 
panel. Expression was observed in eight squamous carcinomas and in 678 adenocarcinomas. Expression was 
seen in in situ and infiltrating components. Expression levels were low to moderate in the adenocarcinomas. In 
general expression was higher in the squamous carcinomas and in two the expression was strong. Possible tow 
35 level expression in lymph nodes. 

DNA35638 : 
IS98-124: 

This study examined the expression of DNA35638 (SEQ ID NO 35) in inflamed human tissues 
40 (psoriasis, IBD, inflamed kidney, inflamed lung, hepatitis (liver block), normal tonsil, aduh and chimp 
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multiblocks) DNA35638 (SEQ ID NO:35) has been shown elsewhere in this application to have 
immuno s n r n u l arory ( enhanc e s T lymphocyte proliferation in (be MLR and costimuiation) and proinflammatory 
properties (induces a neutrophil infiltrate in vivo). 

This study evaluated the differential expression of this molecule in vessels of mflawm j human tissues 
as co mp a r ed to non- inflamed tiss u es In summary, expression was present in the endothelium/intima of large 
vessels in the lung afflicted with chronic inflammation, in the superficial dermal vessels of the psoriatic skin, in 
arterioles in a specimen of chronic sclerosing nephritis, and in cap illari es including the perifoUucular sinuses of 
tonsil DNA35638 (SEQ ID NO:35) was not expressed (as detectable by mis methodology) in normal skin 
(human foreskin specimens), norma! lung, inflamed (8 LBD specimens) or normal large bowel, chronically 
inflamed or cirrhotic liver, normal adult cardiac tissue, or adrenal gland. 

DNA39523 : 
198-052: 

DNA39523 (SEQ ID NO:45) in norma) human skin (neonatal foreskin) and adult psoriatic skin both 
exhibited specific strong expression in the epithelial cells of the stratum basale - the single layer along the 
basement membrane which is the progenitor for all of the overlying epidermal cells in the skin. 

There was no expression in epidermal cells in the overlying layers (stratum spinosum. straum 
granuiosum. etc.). The intensity of the signal was slightly increased in psoriatic skin. Expression was also 
apparent in the dermis (the connective tissue immediately underlying the epidermis) of both normal and 
psoriatic skin. Expression here was most apparent in spindle shape cells within the collagen matrix - the 
stromal fibroblasts. 

In inflamed and normal bowel: Normal human large bowel and bowel with either Crohns's disease or 
ulcerative colitis had specific moderate to strong expression in a multifocal partem within the lamina propria of 
villi. The cells labeled by in situ were spindloid stromal cells best delineated as 

fibroblasts. There was no expression by intestinal epithelial cells and there was no apparent increased 
expression (intensity or frequency) in diseased bowel. Specifically there was also no correlation of expression 
and lesions in the inflamed bowel. 

The expression of DNA39523 (SEQ ID NO:45) in the skin and specific localization to the basal 
epithelial cells of the epidermis cells suggests a potential role in differenuation/mamtenance of the basal 
epidermal cells. This expression panem in combination with the fact mat expression occurs in cells that are 
directly adjacent to the basement lamina, suggests that the cells regulate trafficking of leukocytes into the 
epidermis. As a result DNA39523 (SEQ ID NO:45) may be a constitutive^ expressed signal for the 
trafficking of dendritic/Langerfcan cells or lymphocytes into the epidermis. Such trafficking is a normal 
physiologic eveni that occurs in normal skin and is thought to be involved in immunosurveillance of the skin. 

The expression of DNA39523 (SEQ ID NO:45) in inflammatory bowel disease was not increased 
from normal tissue, and there was no correlation of its expression to inflammatory lesions. Similarly, its 
expression in the basal epidermal cells in psoriatic skin lesions was equivalent to or only slightly greater than 
that seen in normal neonatal skin (but age-matched control adult skin was not available at the time of the 
study). 
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DNA45416 (IS9B-140): 

The expression of DNA45416 (SEQ ID NO:79) was evaluated in a variety of human and non-human 
primate tissues and was found to be highly specific. Expression was present only in alveolar macrophages in 
the lung and in Kupffer cells of the hepatic sinusoids. Expression in these cells was significantly increased 
5 when these distinct cell populations were activated. Though these two subpopulanons of tissue macrophages 
are located in different organs, they have similar biological functions. Both types of these phagocytes act as 
biological filters to remove material from the blood stream or airways including pathogens, senescent cells and 
proteins and both are capable of secreting a wide variety of important proinflammatory cytokines. 

In inflamed lung (7 patient samples) expression was prominent in reactive alveolar macrophage cell 

10 populations defined as large, pale often vacuolated cells present singly or in aggregates within alveoli and was 
weak to negative in normal, non-reactive macrophages (single scattered cells of normal size). Expression in 
alveolar macrophages was increased during inflammation when these cells were both increased in numbers and 
size (activated). Despite the presence of histocytes in areas of interstitial inflammation and peribronchial 
lymphoid hyperplasia in these tissues, expression was restricted to alveolar macrophages. Many of the 

15 inflamed lungs also had some degree of suppurative inflammation; expression was not present in neutrophilic 
granulocytes. 

In liver, there was strong expression in reactive/activated Kupffer cells in livers with acute 
centrilobular necrosis (acetaminophen toxicity) or fairly marked periportal inflammation. However there was 
weak or no expression in Kupffer cells in normal liver or in liver with only mild mflammauon or mild to 
20 moderate lobular hyperp lasia/h ypcrtrophy . Thus, as in the lung, there was increased expression in 
actvated/reacuve cells. 

There was no expression of mis molecule in histiocytes/macrophages present in inflamed bowel, 
hyperplastic/reacnve tonsil or normal lymph node. The lack of expression in these tissues which all contained 
histiocytic inflammation or resident macrophage populations strongly supports restricted expression to the 
25 unique macrophage subset populations defined as alveolar macrophage and hepatic Kupffer cells. However, 
the expression of DNA454216 (SEQ ID NO:79) spleen or bone marrow was riot available for evaluation. 

Human tissues evaluated which had no detectable expression included: Inflammatory Bowel disease 
(7 patient samples with moderate to severe disease), tonsil with reactive hyperplasia, peripheral lymph node, 
psoriatic skin (2 patient samples with mild to moderate disease), heart peripheral nerve. Chimp tissues 
30 evaluated which had no detectable expression included: tongue, stomach, thymus. 

The probes used for the above studies were the following: 
628.pl (SEQ ID NO:2I2): 

5'-GGA TTC TAA TAC G AC TCA CTA TAG GGC CTC CAA GCC CAC AGT G AC AA-3' 
62S.p2(SEQIDNO:213): 
35 5-CTA TG A AAT TAA CCC TCA CTA AAG GG A CCT CCA CAT TTC CTG CCA GTA-3' 

DNA41374 : 
IS-9M77: 

DNA41 374 (SEQ ID NO:248) was expressed in thymic T lymphocytes 
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Summary: in numerous tissues evaluated there expression was only detected as weak diffiise expression in 
thymic T lymphocytes. The limited distribution pattern suggests expression by T lymphocytes or cells closely 
associated with T lymphocytes such as antigen presenting cells (dendritic cell popiuanbns, etc). In inflamed 
human tissue with significant lymphocytic inOammadon and presence of reactive follicle formation 
(inflammatory bowel disease and chronic iymphocync 

interstitial pneumonia/hronchitis) there was no detectable expression in areas which contained significant 
numbers of T lymphocytes. The tissues tested for which there was no detectable expression included; human 
normal tissues: placenta, lung, spleen, adrenal gland, skin kidney, eye, liver, 

human diseased tissue: liver disease: chronic hepatitis, chronic cholangitis, acute centrilobular necrosis 
(acetaminophen toxicity); Neoplasia (tumor mulublock): osteosarcoma, squamous cell carcinoma: human fetal 
tissues: brain, spinal cord, lung, heart kidney, axial and limb musculoskrieton vessels, umbilical cord: non- 
human primate: tongue, thyroid gland, parathyroid gland, stomach, salivary gland. 

1S9&I25. 

DNA4I374 (SEQ ID NO:248> has low level expression in non-human primate thymus and in human 
tonsil in T lynipnocyie specific regions. Tlic limned distribution pattern suggests expression by T lymphocytes 
or cells closely associaied with T lymphocytes such as antigen presenting ceils (dendritic cell populauons. e/c). 
In inflamed ussue with significant lymphocytic mflanunation and presence of reactive follicle formation 
(mflamraatory bowel disease and chronic lymphocytic interstitial pneumonia/bronchitis) there was no 
detectable expression in areas which likely contain significant numbers of T lynrphocytes. 

I n flamed lun S : (chronic lymphocytic and granulomatous pneumonitis): weak to negative signal in the 
interBtiirum compared to the control sense probe. There was weak expression in normal chimp thymus (human 
thymus not available) and in human tonsil. In the latter the expression was predominantly in T lymphocyte 
areas of this structure including the perifollicular marginal zone and in the paracortex. 

There was no detectable expression in ihc following human tissues: mflammatory bowel disease (8 
patient specimens), chronically inflamed and normal hmg (6 patient specimens), chronic sclerosing nephritis 
(I ). chronically and acutely inflamed and cirrhotic liver (10 specimen multiblock), normal and psoriatic skin. 
p erip h eral lymph node (non-reactive). 
The probes used for the above procedures were the following: 

4l374.pl (C337-G): " (SEQ ID NO J06) 

5'-GGA TTC TAA TAC GAC TCA CTA TAG GGC CTC CAC AGA ACC TCG CCA TCA-3' 
41374.p2 (C337-H): (SEQ ^ UQM7) 

5'-CTA TGA AAT TAA CCC TCA CTA AAG GGA TGG GGC AAG ACT CAC AAG CAG-T 

DNA53517 : 
1S98-070: 

DNA53517 (SEQ ID N0255) expression in the normal adult murine lung was patchy, with 
expression in a subset of mucosal epithelial cell in the large airway (bronchi/bronc^^ There is also 
expression within the rare discrete cells in the submucosal intersnrium adjacent to the large airways. These 
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cells, typically 1-3 within a positive focus, are adjacent to large vessels and may represent smooth muscle cells, 
peripheral nerves or Schwann cells, or lymphatics. 

In the murine adult lung with allergic inflammation (eosinophilic, lymphocytic vasculitis,, 
bronchiolitis and pneumonitis), there was diffuse strong expression in all mucosal epithelial cells of all of the 
5 large airways (bronchi/bronchioles) of the lung. There was also strong expression in discrete cells that 
represent a subset of epithelial cells thai line the alveoli; these cells are type II pneumocytes. There is also 
expression, as in normal lung, present within rare discrete cells in the submucosal intersmium adjacent to the 
large airways. 

In normal adult murine small and large intestine, there is strong expression within multifocal few 
10 discrete single cells that are present in the submucosa. the tunica muscularis and the mesentery. The cells that 
express the signal are almost always associated with nerve, vein, artery triads within these areas. These cells 
are spindle shaped and may be either a peripheral nerves. Schwann cells 

associated with such nerves or some type of support cell associated with vessel or lymphatics. Interestingly, 
there is no expression within identifiable myenteric plexi that arc present within the tunica muscularis. 
15 In inflamed large bowel (from an IL10R KO mouse) the partem of expression is similar but 

expression level is significantly decreased. 



IS98-US: 

DNA53715 (SEQ ID NO:255, mouse FIZZ- 1) was used as a detection probe in the following human 
20 tissues: gastric carcinoma, inflamed lung (3 patients) (vessels, alveoli, large airways and mucous glands), 
aorta, heart, placenta and gall bladder. 

Expression of mouse DNA53715 (SEQ ID N0.255) was present in normal mouse lung in large airway 
epithelium and had marked increased expression in inflamed murine lung (airway epithelium, type II alveolar 
pneumocytes). It was also expressed in discrete cells in the submucosa of the large bowel along vascular 
25 channels. 



DNA84210 : 

The following probes were used in the in situ studies indicated below: 
842 10.p 1 (F-7961 9): (SEQ ID NO:3 1 0) 

30 5-GGA TTC TAA TAC GAC TCA CTA TAG GGC GCG GTC GCA GGA CAT TCA GTA-3' 

842 I0.p2 (F-79620): (SEQ ID NO:3 1 1 ) 

5*-CTA TG A AAT TAA CCC TCA CTA AAG GGA ACT CTT TGG GTT CCA GCA CAC-3' 

DNA84210 (SEQ ID NO:285) is expressed in fetal kidney, primarily in developing glomeruli and 
tubules of the cortical zone and also weakly in fetal lung and spinal cord. There is also expression in stromal 

35 cells adjacent to developing cartilage and bone. In adult tissues, weak expression is seen in normal bronchial 
epithelium, in one (adenocarcinoma) of five lung tumors (2 squamous and 3 adenocarcinomas) and in a 
chondrosarcoma. There is possibly expression in the skin and its appendages, however, die section is folded 
and difficult to evaluate. 



40 IS99-102: 
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Expression of DNA842I0 (SEQ ID NO:285) in malignant melanoma, lung mmor. colon tumor, cell 
pellet mouse tissues, fetal tissues. 

Expression of DNA84210 (SEQ ID NO:285) is seen in several adult (neoplastic and non-neoplasric) 
and fetal tissues. As far as normal adult tissues are concerned DNA842I0 (SEQ ID NO:285) is seen in the 
epidermis of skin (mostly in nasally located cells) and in skin appendages, such as hair follicles and sebaceous 
glands associated with them. Expression is also seen in bronchial epithelium and submucosal bronchial glands 
In human fetal tissues, expression of DNA84210 (SEQ ID NO:285) is seen in skin and skin appendages, lung, 
renal cortex and pancreatic ducts. It is also seen in mesenchymal cells adjacent to developing bone and 
cartilage. There is no hybridization signal seen in mouse embryos. Expression of DNA84210 (SEQ ID 
NO:2S5) is seen in one of six colorectal adenocarcinomas (weak). 2 of 3 lung Bdenocarcinomas (one shows 
strong, but very focal expression, one is very weakly positive). 0 of 3 lung squamous ceil carcinomas and 1 of 
I chondrosarcomas (weak). Expression is also seen in 5 of 5 malignant melanomas, the intensity of expression 
ranges from very weak to strong. These sections also demonstrate expression of DNA84210 (SEQ ID NO:285) 
in normal epidermis and skin appendages. 

EXAMPLE 8 
Use of the PRO polypeptides as a hybridization probe 
The following method describes use or a nucleotide sequence encoding the PRO polypeptides as a 
hybridization probe. 

DNA comprising the coding sequence of full-length or mature PRO polypeptides is employed as a 
probe to screen for homologous DNAs (such as those encoding naturally-occurring variants) in human tissue 
cDNA libraries or human tissue genomic libraries. 

Hybridization and washing of filters containing cither library DNAs is performed under the following 
high stringency conditions. Hybridization of radiolabeled PRO - derived probe (e.g. PRO200. PRO204. 
PR0212. PR0216. PR0226. PRO240. PR0235. PR0245. PR0172. PR0273. PR0272. PR0332. PR0526. 
PRO70I. PR036I. PR0362. PR0363. PR0364. PR0356. PR0531. PR0533. PR01083. PR0865. PRO770. 
PR0769. PR0788. PR01II4. PRO1007. PROU84. PROI031. PROI346. PR01155. PRO1250. PROI3I2. 
PROI 192. PROJ246. PROI2S3. PROI 195. PR01343. PR01418. PR01387. PROI410. PR019I7. PR01868. 
PRO205. PR02I. PR0269. PR0344. PR0333. PR038K PRO720. PR0866. PRO840. PR0982. PR0836. 
PROI 159. PR01358. PR01325. PR01338. PR01434. PR04333. PRO4302, PRO4430 or PR05727) to the 
filters is performed in a solution of 50% formamide. 5x SSC, 0.1% SDS. 0.1% sodium pyrophosphate. 50 mM 
sodium phosphate. pH 6.8. 2x Denhardfs solution, and 10% dextran sulfate at 42°C for 20 hours. Washing of 
the filters is performed in an aqueous solution of 0. lx SSC and 0. 1% SDS at 42°C. 

DNAs having a desired sequence identity with the DNA encoding full-length native sequence PRO 
polypeptide can then be identified using standard tecimiques known in the an. 



EXAMPLE 9 
Expression of the P RO polypeptide in E. coli 
example illustrates prerjaration of an nnglycosylated form of the PRO polypeptides by 
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recombinant expression in £ coli. 

The DNA sequence encoding the PRO polypeptide is initially amplified using selected PCR primers. 
The primers should contain restriction enzyme sites which correspond to the restriction enzyme sites on the 
selected expression vector. A variety of expression vectors may be employed. An example of a suitable vector 
is pBR322 (derived from E. coli: see Bolivar a aL Gene. 2:95 (1977)) which contains genes for ampicilJin and 
tetracycline resistance. The vector is digested with restriction enzyme and dcphosphorylatcdL The PCR 
amplified sequences are then ligated into the vector. The vector will preferably include sequences which 
encode for an antibiotic resistance gene, a trp promoter, a polyhis leader (including the first six STO codons, 
poryhis sequence, and enterokinase cleavage site), the PRO polypeptide coding region, lambda transcriptional 
terminator, and an argU gene. 

The ligation mixture is then used to transform a selected £. coli strain using the methods described in 
Sambroox at a/., supra. Transformanrs are identified by their ability to grow on LB plates and antibiotic 
resistant colonies are then selected. Plasmid DNA can be isolated and confirmed by restriction analysis and 
DNA sequencing. 

Selected clones can be grown overnight in liquid culrure medium such as LB broth supplemented with 
antibiotics. The overnight culture may subsequently be used to inoculate a larger scale culture. The cells arc 
then grown to a desired optical density, during which the expression promoter is turned on. 

After culmrine the cells for several more hours, the cells can be harvested by ccntrirugatiun. The cell 
pellet obtained by the centrirugation can be solubiiized using various agents known in the art. and the 
solubilized PRO polypeptide protein can then be purified using a metal chelating column under conditions thai 
allow tight binding of the protein. 

The PRO polypeptides may also be expressed in £. coli in a poly-His tagged form, using the following 
procedure. The DNA encoding a PRO polypeptide is initially amplified using selected PCR primers. The 
primers contain restriction enzyme sites which correspond to the restriction enzyme sites on the selected 
expression vector, and other useful sequences providing for efficient and reliable translation initiation, rapid 
purification on a metal chelation column, and proteolytic removal with enterokinase. The PCR-amplificd. 
poly-His tagged sequences are then ligated into an expression vector, which ts used to transform an E. coli host 
based on strain 52 CW3I 10 ruhAftonA) Ion galE rpoHts(htpRts) clpP(lacIq). Transfonnants are first grown in 
LB containing 50 mg/mi carbenicillin at 30'C with shaking until an O.D.600 of 3-5 is reached. Cultures are 
then diluted 50-100 fold into CRAP media (prepared by mixing 3.57 g (NH 4 ) 2 S0 4 , 0.71 g sodium 
<mtate-2H20. 1.07 g KCl 5.36 g Difco yeast extract, 5.36 g Sheffield hycase SF in 500 mL water, as well as 
1 10 mM MPOS, pH 7.3. 0.55% (w/v) glucose and 7 mM MgS0 4 ) and grown for aijproximatcly 20-30 hours at 
30*C with shaking. Samples are removed to verify expression by SDS-PAGE analysis, and the bulk culture is 
centrifuged to pellet the cells. Cell pellets are frozen until purification and refolding. 

E. coli paste from 0.5 to I L fermentations (6-10 g pellets) is resuspended in 10 volumes (w/v) in 7 M 
guanidine, 20 mM Tris, pH 8 buffer. Solid sodium sulfite and sodium tetmtmonate is a dded to make final 
concentrations of 0.1M and 0.02 M. respectively, and the solution is stirred overnight at 4'C. This step results 
in a denatured protein with all cysteine residues blocked by sulfitolizarion. The solution is centrifuged at 
40,000 rpro in a Beckman Iflmicenrifuge for 30 mis. The supernatant is diluted with 3-5 volumes of metal 
chelate column buffer (6 M guanidine. 20 mM Tris. P H 7.4) and filtered through 0.22 micron filters to clarify. 
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Depending on condition the clarified extract is loaded onto a 5 ml Qiagen Ni-NTA metal chelate column 
cqnflibrated in the metal chelate cohinm buffer. The column is washed with additional buffer containing 50 
mM imidazole (Calbiochtm Utrol grade), pH 7.4. The protein is eluted with buffer containing 250 mM 
imidazole. Fractions containing the desired protein was pooled and stored at 4'C. Protein concentration is 
estimated by its absorbance at 280 nm using the calculated extinction coefficient based on its amino acid 
sequence. 

The proteins are refolded by diluting sample slowly into freshly prepared refolding buffer consisting of: 
20mMTris.pH8.6. 0.3 M NaCl, 2.5 M urea. 5 mM cysteine. 20 mM glycine and I mM EDTA. Refolding 
volumes are chosen so that the final protein concentration is between 50 to 100 imcrograrns/ml. The refolding 
solution is stirred gently at 4»C for 12-36 hours. The refolding reaction is quenched by the addition of TFA to 
a final concentration of 0.4% (pH of approximately 3). Before further purification of me protein, the solution 
is filtered through a 0.22 micron filter and acetonitrile is added to 2-10% final concentration. The refolded 
protein is chromatography on a Poros RI/H reversed phase column using a mobile buffer of 0.1% TFA with 
elution with a gradient of acctonitrile from 10 10 80%. Aliquots of fractions with A280 absorbance are 
analyzed on SDS polyacrylamide gels and fractions containing homogeneous refolded protein arc pooled. 
Generally, the properly refolded species of most proteins are eluted at the lowest concentrations of acetonitrile 
since those species are the most compact with their hydrophobic interiors shielded from mtcraction with the 
reversed phase resin. Aggregated species are usually eluted at higher acetonitrile concentrations. In addition 
to resolving misfoldcd forms of proteins from the desired form, the reversed phase step also removes endotoxin 
from the samples. 

Fractions containing the desired folded PRO polypeptide proteins are pooled and the acetonitrile 
removed using a gentle stream of nitrogen directed at the solution. Proteins are formulated into 20 mM Hepes, 
pH 6.8 with 0.14 M sodium chloride and 4% matmitol by dialysis or by gel filtration using G25 Superfine 
(Pharmacia I resins equilibrated in the formulation buffer and sterile filtered. 

EXAMPLE 10 
Expression of the PRO nolv neptides in mammalian cells 
This example illustrates preparation of a potentially glycosylated form of the PRO polypeptide in 
recombinant expression in mammalian cells. 

The vector. pRK5 (see EP 307.247. published March 15. 1989). is employed as the expression vector. 
Optionally, the PRO DNA is ligated into pRJC5 with selected restriction enzymes to allow insertion of the 
respective PRO DNA using ligation methods such as described in Sambiook a aL sugra. The resulting vector 
is called, for example, pRXi-PRO. 

In one embodiment the selected host cells may be 293 cells. Human 293 cells (ATCC CCL 1573) are 
grown to confluence in tissue culture plates in medium such as DMEM supplemented with fetal calf serum and 
optionally, nutrient components and/or antibiotics. About 10 pg of the pRK5-PRO DNA is mixed with about 1 
Ug DNA encoding the VARNA gene fThiinmappaya a aL, Cell, 31 :543 (1982)] and dissolved in 500 uL of 1 
mM Tris-HCL 0.1 mM EDTA, 0.227 M CaCl,. To this mixture is added, dropwise, 500 pL of 50 mM HEPES 
(pH 7J5). 280 mM NaCL 1.5 mM NaP0 4 , and a precipitate i, allowed to form for 10 minutes at 25°C. The 
precipitate is suspended and added to the 293 cells and allowed m settle for about four hours at 37»C. The 
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cufture medium is aspirated off and 2 ml of 20% glycerol in PBS is added for 30 seconds. The 293 cells are 
then washed with serum free medium, fresh medium is added and the cells are incubated for about 5 days. 

Approximately 24 hours after the transfecbons, the culture medium is removed and replaced with 
culture medium (alone) or culture medium containing 200 uCi/ml "S-cysteine and 200 uCi/ml M S-methionine. 
5 After a 12 hour incubation, the conditioned medium is collected, concentrated on a spin filter, and loaded onto 
a 15% SDS gel. The processed gel may be dried and exposed to film for a selected period of time to reveal the 
presence of the polypeptide of the invention polypeptide. The cultures containing transfected cells may 
undergo further incubation (in serum free medium) and the medium is tested in selected bioassays. 

in an alternative technique, pRJC5-PRO may be introduced into 293 cells transiently using the dextran 

10 sulfate method described by Somparyrac ct al.. Proc Natl, Acad. ScU U:7575 (198 1). 293 cells are grown to 
maximal density in a spinner flask and 700 ug pRK5«PRO is added The cells are first concentrated from the 
spinner flask by centrifugation and washed with PBS. The DNA-ccxiran precipitate is incubated on the cell 
pellet for four hours. The cells are treated with 20% glycerol for 90 seconds, washed with tissue culture 
medium, and re-introduced into the spinner flask containing tissue culture medium, 5 ug/ml bovine insulin and 

15 0.1 ug'ml bovine transferrin. After about lour days, the conditioned media is centriruged jnd filtered to 
remove cells and debris. The sample containing the expressed polypeptide of the invenuon can then be 
concentrated and purified by any selected method, such as dialysis and'br column chromatography. 

In another embodiment, the polypeptides of the invention can be expressed in CHO cells. The pRK5- 
PRO can be transfected into CHO cells using known reagents such as CaPO« or DEAE-dextran. As described 

20 above, the cell cultures can be incubated, and the medium replaced with culture medium (alone) or medium 
containing a radiolabel such as "S-methionine. After determining the presence of a polypeptide of the 
invention polypeptide, the culture medium may be replaced with scrum free medium. Preferably, the cultures 
are incubated for about 6 days, and then the conditioned medium is harvested. The medium containing the 
expressed polypeptide of the invention can then be concentrated and purified by any selected method. 

25 Epitope-taggcd polypeptide of the invention may also be expressed in host CHO cells. The DNA 

encoding the desired polypeptide of ihc invention may be subcloned out of the pRK5 vector. The subclone 
insert can undergo PCR to fuse in frame with a selected epitope tag such as a poly-his tag into a Baculovtrus 
expression vector. The poly-his tagged polypeptide of the invention insert can then be subcloned into a SV40 
driven vector containing a selection marker such as DHFR for selection of stable clones. Finally, the CHO 

30 cells can be transfected (as described above) with the SV40 driven vector. Labeling may be performed, as 
described above, to verify expression. The culture medium containing the gjtprfsy^ poIy-His tagged 
polypeptide of the invention can then be concentrated and purified by any selected method, such as by Ni 2 *- 
chelate affinity chromatography. 
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EXAMPLE 11 
Expression of PRO in Yeast 
The following method describes recombinant expression of PRO in yeast. 

First yeast expression vectors are constructed for intracellular production or secretion of the PRO 
5 polypeptide from the ADH2A3APDH promoter. DNA encoding a polypeptide of the invention and the 
promoter is inserted into suitable restriction enzyme sites in me selected plasmid to direct intracellular 
expression of the PRO. For secretion. DNA encoding the PRO can be cloned into the selected plasmid, 
together with DNA encoding the ADH2/GAPDH promoter, a native sequence PRO signal peptide or other 
mammalian signal peptide, or. for example, a yeast alpha-factor or invenase secretory signal/leader sequence, 

10 and linker sequences (if needed) for expression of the polypeptide of the invention. 

Yeast cells, such as yeast strain AB110. can then be transformed with the expression plasmids 
described above and cultured in selected fermentation media. The transformed yeast supematams can be 
analyzed by precipitation with 10% trichloroacetic acid and separation by SDS-PAGE, followed by staining of 
the gels with Coomassie Blue stain. 

15 Recombinant PRO can subsequently be isolated and purified by removing the yeast cells from the 

fermentation medium by ccntnftigaiion and then concentrating the medium using selected cartridge filters. The 
concentrate containing the polypeptide of the invention may further be purified using selected column 
chromatography resins. 



20 EXAMPLE 12 

Expression of PR O in Bacnlovirus-lnfected Insect Cells 
The following method describes recombinant expression of PRO in Baculovirus-infccted insect cells. 
The sequence coding for PRO is fused upstream of an epitope tag contained within a baculovirus 
expression vector. Such epitope tags include poly-his tags and irnmunoglobulin tags (like Fc regions of IgG). 
25 A variety of plasmids may be employed, including plasmids derived from commercially available plasmids 
such as pVL1393 (Novagen). Briefly, the sequence encoding a polypeptide of the invention or the desired 
portion of the coding sequence of the DNA encoding a PRO polypeptide (such as the sequence encoding the 
extracellular domain of a transmembrane protein or the sequence encoding the mature protein if the protein is 
extracellular] is amplified by PCR with primers complementary to the 5' and 3' regions. Tne S primer may 
incorporate flanking (selected) restriction enzyme sites. The product is then digested with those selected 
restriction enzymes and subcloned into the expression vector. 

Recombinant baculovirus is generated by cc-transfecring me above plasmid and BaculoGold™ vims 
DNA (Pharmingen) into Spodoptera Jntgipcrda CSf9") cells (ATCC CRL 1711) using lipofemn 
(cornmercially available from GIBCO-BRL). After 4 . 5 days of incubation at 28°C, the released viruses are 
harvested and used for further amplifications. Viral infection and protein expression are performed as 
described by OReilley et at.. Baculovirus expression vectors: A Laboratory Manual. Oxford: Oxford 
University Press (1994). 

Expressed poly-his tagged polypeptide of the invention can then be purified, for example, by Ni**- 
chelate atTiniry chromatography as follows. Extracts are prepared torn recorribinant viru^mfected SS> cells as 
40 described by Rupert et aL Nature. 362:175-179 (1993). Briefly, Sf9 ceils are washed, suspended in 



30 



35 



157 



SUBSTITUTE SHEET (RULE 26) 



(0 2003 Copyright Derwent Information Ltd. 



WO 00/53758 



PCT/US00/05841 



sonication buffer (25 mL Hepes, pH 7.9: 12.5 mM MgCU; 0. 1 mM EDTA; 10% glycerol: 0.1% NP-40: 0.4 M 
K.C1), and sonicated twice for 20 seconds on ice. The sonicates are cleared by centrifugatiork and the 
supernatant is diluted 50-fold in loading buffer (50 mM phosphate, 300 mM NaCL 10% glycerol. pH 7.8) and 
filtered through a 0.45 urn filter. A Ni 2 *-NTA agarose column (commercially available from Qiagen) is 
5 prepared with a bed volume of 5 mL. washed with 25 mL of water and equilibrated with 25 mL of loading 
buffer. The filtered cell extract is loaded onto the column at 0.5 mL per minute. The column is washed to 
baseline A 2 so with loading buffer, at which point fraction collection is started. Next, the column is washed with 
a secondary wash buffer (50 mM phosphate; 300 mM Nad 10% glycerol. pH 6.0). which elutes 
nonspccifically bound protein. After reaching A M0 baseline again, the column is developed with a 0 to 500 
10 mM Imidazole gradient in the secondary wash buffer. One mL fractions are collected and analyzed by SDS- 
PAGE and silver staining or Western blot with Nr"-NTA-conjugated to alkaline phosphatase OQiagen). 
Fractions containing the eluted HLH, r tagged- polypeptide of the invention are pooled and dialyzed against 
loading buffer. 

Alternatively, purification of the IgG tagged (or Fc tagged) PRO polypeptide can be performed using 
1 5 known chromatography techniques, including for instance. Protein A or protein G column chromatography. 

EXAMPLE 13 
Preparation of Antibodies that Bind PRO 
This example illustrates preparation of monoclonal antibodies which can specifically bind the 
20 polypeptides of the invention. 

Techniques for producing me monoclonal antibodies arc known in the art and are described, for 
instance, in Coding, supra. Immunogens that may be employed include the purified polypeptide of the 
invenuon itself, fusion proteins containing the respective polypeptide of the invention, and cells 
expressing recombinant polypeptide of the invention on the cell surface. Selection of the hnmunogen can be 
25 made by the skilled artisan without undue experimentation. 

Mice, such as Balb/c. arc immunized with the polypeptide of the invention tmmunogen emulsified in 
complete Fremiti's adjuvant and injected subcutaneous! y or intraperitoneally in an amount from 1-100 
micrograms. Alternatively, the immunogen is emulsified in MPL-TDM adjuvant (Ribi Immunochemical 
Research. Hamilton. MT) and injected into the animal's hind foot pads. The immunized mice are then boosted 
30 10 to 12 days later with additional immunogen emulsified in the selected adjuvant Thereafter, for several 
weeks, the mice may also be boosted with additional immunization injections. Serum samples may be 
periodically obtained from the mice by retro-orbital bleeding for testing in ELISA assays to detect antibodies 
specific to the respective polypeptide of the invention. 

After a suitable antibody titer has been detected, the animals "positive" for antibodies can be injected 
35 with a final intravenous injection of the respective polypeptide of the invention. Three to four days later, the 
mice are sacrificed and the spleen cells are harvested. The spleen cells are then fused (using 35% polyethylene 
glycol) to a selected murine myeloma cell line such as P3X63AgU.I, available from ATCC, No. CRL 1597. 
The fusions generate hybridoma cells which can then be plated in 96 well tissue culture plates containing HAT 
(hypoxanthine. aminopterin* and thymidine) medium to inhibit proliferation of non-fused cells, myeloma 
40 hybrids, and spleen ceil hybrids. 
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The hybridoma ceils are screened in an EUSA for reactivity against the respective polypeptide of the 
invention. Determination of "positive" hybridoma cells secreting the desired monoclonal antibodies against the 
polypeptides of the invention is within the skill in the art. 

The positive hybridoma cells can be injected intraperitoneally into syngeneic Balh/c mice to produce 
5 ascites containing the am>PRO200, antt-PRO204. ann.PRQ212, ami-PR0216. anti-PR0226, ami-PRO240, 
anri-PR0235. anti-PR0245. anu-PR0172, anti-PR0273. anti-PR0272. anti-PR0332, anri-FR0526, anti- 
PRO701, anti-PR0361. anti-PR0362. anti-PR0363. anri-PR0364, anu-PR0356. anti-PR053U ami-PR0533, 
ann-PROl083. anti-PR0865, anu-PRO770. anti-PR0769. anti-PR0788. anri-PR01114. anri-PRO1007, anri- 
PROII84, anti-PRO1031, anti-PROI346, anu-PROU55, anti-PROl250. anri-PROl312, anti-PROl 192, and- 

10 PR01246. ami-PROl283. anti-PRO! 195. anti-PR01343, ami-PR014I8 t anti-PRO!387. anti-PRO1410, anu- 
PR01917, anb-PR01868. anii-PRO205. ami-PR021. anti-PR0269, ano>PR0344. anti-PR0333. anu-PR0381, 
anti-PRO720, anti-PR0866. anu-PRO840. ami-PR0982. ami-PR0836, anti-PROl 159, ami-PROl358, anu- 
PR01325, anti-PR01338, ann- PRO 1434, ami-PR04333, anti-PRO4302. anu-PRO4430 or anti-PR05727 
monoclonal antibodies. Alternatively, the hybridoma cells can be grown in tissue culture flasks or roller 

15 botties. Purification of the monoclonal antibodies produced in the ascites can be accomplished using 
ammonium sulfate precipitation, followed by gel exclusion chromatography. Alternatively, affinity 
chromatography based upon binding of antibody to protein A or protein G can be employed. 

Deposit of Material 

20 The following materials have been deposited with the American Type Culture Collection. 10801 

University Blvd. Manassas. VA 20 11 0-2209, USA (ATCC): 





Material 


UNO 


PRO 


ATCC # 


ATCC Deposit Date 




DNA29I0I-I276 


174 


200 


209653 


March 5. 1998 


25 


DNA3087 1-1157 


178 


204 


209380 


October 16. 1997 




DNA30942-M34 


186 


212 


209254 


September 16. 1997 




DNA33087-U58 


190 


216 


209381 


October 16. 1997 




DNA33460-1166 


200 


226 


209376 


October 16, 1997 




DNA34387-1 133 


214 


240 


209260 


September 16. 1997 


30 


DNA35558-M67 


209 


235 


209374 


October 16. 1997 




DNA35638-1 141 


219 


245 


209265 


September 16, 1997 




DNA35916-U61 


146 


172 


209419 


October 28, 1997 




DNA39523-1 192 


240 


273 


209424 


October 3 1,1997 




DNA40620-1183 


239 


272 


209388 


October 17, 1997 


35 


DNA40982-1235 


293 


332 


209433 


November 17, 1997 




DNA44I84-1319 


330 


526 


209704 


March 26, 1998 




DNA44205-1285 


365 


701 


209720 


Moth 31, 1998 




DNA454I0-1250 


316 


361 


209621 


Februarys, 1998 




DNA45416-I251 


317 


362 


209620 


February 5. 1998 


40 


DNA45419-1252 


318 


363 


209616 


February 5, 1998 
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DNA47365-1206 


319 


364 


209436 


November 7.1 997 




DNA47470-1130 


313 


356 


209422 


October 28, 1997 




DNA483 14-1320 


332 


531 


209702 


March 26. 1998 




DNA49435-1219 


334 


533 


209480 


November 2 1,1997 


5 


DNA5092M458 


540 


1083 


209859 


May 12. 1998 




DNA53974-1401 


434 


865 


209774 


April 14. 1998 




DNA54228-1366 


408 


770 


209801 


April 23. 1998 




DNA5423 1-1366 


407 


769 


209802 


April 23. 1998 




DNA56405-1357 


430 


788 


209849 


May 6, 1998 


10 


DNA57033-14O3 


557 


1114 


209905 


May 27. 1998 




DNA57690-1374 


491 


1007 


209950 


June 9, 1998 




DNA59220-1514 


598 


1184 


209962 


June 9. 1998 




DNA59294-1381 


516 


1031 


209866 


May 14. 1998 




DNA59776-1600 


701 


1346 


203128 


August 18. 1998 


15 


DNA59849-1504 


585 


1155 


209986 


June 16, 1998 




DNA60775-1532 


633 


1250 


203173 


September 1. 1998 




DNA61873-1574 


678 


1312 


203132 


August 18. 1998 




DNA628U-1521 


606 


1192 


203093 


August 4. 1998 




DNA64885-1529 


630 


1246 


203457 


November 3. 1998 


20 


DNA65404-155I 


653 


1283 


203244 


September 9. 1998 




DNA654I2-1523 


608 


1195 


203094 


August 4. 1998 




DNA66675-1587 


698 


1343 


203282 


September 22. 1998 




DNA68864-1629 


732 


1418 


203276 


September 22. 1998 




DNA68872-1620 


722 


1387 


203160 


August 25. 1998 


25 


DNA68874-1622 


728 


1410 


203277 


September 22. 1998 




DNA76400-2528 


900 


1917 


203573 


January 12. 1999 




DNA77624-2515 


859 


1868 


203553 


December 22. 1998 




DNA30868-H56 


179 


205 





March 2, 2000 




DNA36638-1056 


21 


21 


209456 


November 12. 1997 


30 


DNA38260-1180 


236 


269 


209397 


October 17. 1997 




DNA40592-1242 


303 


344 


209492 


November 2i, 1997 




DNA41374-1312 


294 


333 









DNA44194-1317 


322 


381 


209808 


ApriJ28. 1998 




DNA53517-1366 


388 


720 


209802 


April 23. 1998 


35 


DNA53971-1359 


435 


866 


209750 


April 7. 1998 




DNA53987-1438 


433 


840 


209858 


May 12, 1998 




DNA5770O-1408 


483 


982 


203583 


January 12, 1999 




DNA59620-I463 


545 


836 


209989 


June 16, 1998 




0NA6O627-15O8 


589 


1159 


203092 


August 4. 1998 


40 


DNA64890-1612 


707 


1358 


203131 


August 18, 1998 
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DNA66659-1593 


685 


1325 


203269 


September 22. 1998 


DNA66667-1596 


693 


1338 


203267 


September 22, 1998 


DNA688 18-2536 


739 


1434 


203657 


February 9, 1999 


DNA84210-2576 


1888 


4333 


203818 


March 2, 1999 


DNA922 18-2554 


1866 


4302 


203834 


March 9. 1999 


DNA96878-2626 


1947 


4430 


23-PTA 


May 5, 1999 


DNA98853-1739 


2448 


5727 


203906 


April 6, 1999 



These deposits was made under the provisions of the Budapest Treaty on the International 

10 Recognition of the Deposit of Microorganisms for the Purpose of Patent Procedure and the Regulations 
thereunder (Budapest Treaty). This assures mainienance of a viable culture of the deposit for 30 years from the 
date of deposit. The deposit will be made available by ATCC under the terms of the Budapest Treaty, and 
subject to an agreement between Genentech, Inc. and ATCC. which assures permanent and unrestricted 
availabiliry of the progeny of the culture of the deposit to the public upon issuance of the pertinent U.S. patent 

15 or upon laying open to the public of any U.S. or foreign patent application, whichever comes first, and assures 
availabiliry of the progeny to one determined by the U.S. Commissioner of Patents and Trademarks to be 
entitled thereto according to 35 USC 122 and the Commissioner's rules pursuant thereto (including 37 CFR 
1.14 with particular reference to 886 OC 638). 

The assignee of the present application has agreed that if a culture of trie materials on deposit should 

20 die or be lost or destroyed when cultivated under suitable conditions, the materials will be promptly replaced 
on notification with another of the same. Availabiliry of the deposited material is not to be construed as a 
license to practice the invention in contravention of the rights granted under the authority of any government in 
accordance with its patent laws. 

The foregoing written specification is considered to be sufficient to enable one skilled in the an to 

25 practice the invention. The present invention is not to be limited in scope by the construct deposited, since the 
deposited embodiment is intended as a single illustration of certain aspects of the invention and any constructs 
that are runctionally equivalent are within the scope of this invention. The deposit of material herein does not 
constitute an admission that the written description herein contained is inadequate to enable the practice of any 
aspect of the invention, including the best mode thereofl nor is it to be construed as [uniting the scope of the 

30 claims to the specific illustrations that it represents. Indeed, various modifications of the invention in addition 
to those shown and described herein will become apparent to those skilled in the art from the foregoing 
description and fell within the scope of the appended claims. 
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What is claimed: 

1. A composition useful for the treatment of immune related diseases, comprising a PRO200. 
PRO204, PR0212. PR0216, PR0226. PRO240. PR0235. PR0245. PR0172, PR0273, PR0272, PR0332, 
PR0526, PRO701, PR036I, PR0362, PR0363, PR0364. PR0356, PR0531. PR0533, PRO1083. PR0865, 
PRO770, PR0769. PR0788, PROl 114. PRO1007, PROl 184, PRO1031, PR01346, PROl 155. PRO1250. 
PR01312, PR01192, PR01246, PROI283. PROl 195. PR01343, PR01418. PR01387, PRO1410, PR01917. 
PROI868. PRO205. PR021. PR0269. PR0344. PR0333. PR0381, PRO720, PR0866, PRO840. PR0982. 
PR0836. PROl 159. PR01358. PROI325, PR01338. PR01434, PR04333. PRO4302, PRO4430 or PR05727 
polypeptide, agonist or fragment thereof and a earner or excipiem. having the properties of: 

(a) increasing infiltration of inflammatory cells into a tissue of a mammal in need thereof, 

(b) stimulating or enhancing an immune response in a mammal in need thereof, or 

(c) increasing the proliferation of T- lymphocytes in a mammal in need thereof in response to an 
antigen. 

2. , The composition of claim I comprising an effective amount of a PRO200. PR0204. 
PR0212. PR0216. PR0226. PRO240. PR0235. PR0245. PRO 1 72. PR0273. PR0272. PR0332. PR0526. 
PRO701. PR0361. PR0362. PR0363. PR0364. PR0356. PR053L PR0533. PRO1083. PR0865. PRO770. 
PR0769. PR0788. PROH14. PROI007. PROUS4. PRO103L PR01346. PR01I55, PRO1250. PR01312, 
PROl 192. PR01246. PROI283. PROl 195. PR01343. PROI4I8, PR01387. PROI410. PR01917, PR01868. 
PRO205. PR02I, PR0269. PR0344. PR0333. PR0381, PRO720, PR0866. PRO840. PR0982. PR0836. 
PROl 159. PR01358. PR01325, PR01338. PROK34, PR04333, PRO4302. PRO4430 or PR05727 
polypeptide, agonist, antagonist or fragment thereof. 

3. The composition of claim 2 further comprising a growth inhibitory agent, cytotoxic agent or 
chemotherapeuiic agent 

4. Use of a PRO200. PRO204. PR0212. PR0216. PR0226. PRO240. PR0235. PR0245. 
PR0172. PR0273. PR0272. PR0332, PR0526. PRO701. PR0361, PR0362. PR0363, PR0364. PR0356, 
PR0531. PR0533, PRO1083. PR0865. PRO770. PR0769. PR0788. PROl 114, PRO1007. PROl 184. 
PRO103I. PR01346, PROl 155. PRO1250. PR01312, PROl 192, PR01246, PR01283. PROl 195. PR01343, 
PR01418, PR01387. PRO1410. PR019I7, PR01868, PRO205, PR021. PR0269, PR0344, PR0333. 
PR0381, PRO720, PR0866. PRO840. PR0982. PR0836, PROl 159, PR01358, PRO 1325. PR01338. 
PR01434. PR04333, PRO4302. PRO4430 or PR05727 polypeptide, agonist or a fragment thereof to prepare 
a composition having the properties of: 

(a) increasing infiltration of inflammatory cells into a tissue of a mammal in need thereof, 

(b) stimulating or enhancing an immune response in a mammal in need thereof, or 

(c) increasing the proliferation of T-lymphocytes in a mammal in need thereof in response to an 

5. The use of claims 4 comprising an effective amount of a PRO200, PRO204, PR02I2, 
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PR0216. PR0226. PRO240, PR0235, PR0245. PR0172, PR0273. PR0272. PR0332. PR0526. PRO701, 
PR0361, PR0362, PR0363, PR0364, PR0356, PROS31. PR0533, PRO1083, PR0865. PRO770. PR0769, 
PR0788. PROII14, PRO1007. PROII84. PRO1031. PR01346, PROU55. PRO1250, PR01312, PR01192, 
PR01246, PR01283, PR01I95. PR01343, PROI418, PR01387, PRO14I0, PR01917, PR01868, PRO205. 
5 PR021. PR0269. PR0344. PR0333, PR0381, PRO720, PR0866. PRO840. PR0982. PR0836, PROl 159. 
PROI358. PROI325. PR01338, PROI434. PR04333, PRO4302, PRO4430, PR05727 polypeptide, agonist, 
antagonist or fragment thereof! 

6. The composition of claim 2 further comprising a growth inhibitory agent, cytotoxic agent or 
10 chemotherapeutic agent. 

7. A method of treating an immune related disorder, such as a T cell mediated d isomer, in a 
mammal in need thereof, comprising administering to me mammal an effective amount of a PRO200. PRO204. 
PR0212. PR0216, PR0226. PRO240. PR0235, PR0245, PROI72. PR0273, PR0272, PR0332. PROS26. 

15 PRO701. PR0361. PR0362. PR0363. PR0364. PR0356. PR0531. PROS33. PRO1083. PR0865. PRO770. 

PR0769. PR0788. PR011I4. PRO1007. PROU84. PRO103I. PR01346, PR01155. PRO1250. PR01312. 

PROl 192. PR01246. PROI283. PROl 195. PR01343. PROI418. PR013B7. PRO1410, PROI9I7. PROI868. 

PRO205. PR021. PR0269. PR0344, PR0333, PR038I. PRO720. PR0866. PRO840. PR0982. PR0836, 

PROU59. PR01358. PR01325. PR01338, PROI434, PR04333, PRO4302, PRO4430 or PR05727 
20 polypeptide, an agonist antibody thereof, on antagonist antibody thereto, or a fragment thereof. 

8. The method of claim 7. wherein the disorder is selected from systemic lupus erythematosis, 
rheumatoid arthritis, osteoarthritis, juvenile chronic arthritis, spondyloarthropathies, systemic sclerosis, 
idiopathic inflammatory myopathies. Sjogren's syndrome, systemic vasculitis* sarcoidosis, autoimmune 

25 hemolytic anemia, autoimmune thrombocytopenia, thyroiditis, diabetes mellitus. immune-mediated renal 
disease, dcmyelinating diseases of the central and peripheral nervous systems such as multiple sclerosis, 
idiopathic dcmyelinating polyneuropathy or Guillain-Barre syndrome, and chronic inflaminatory 
dcmyelinating polyneuropathy, hepatobiliary diseases such as infectious, autoimmune chronic active hepatitis, 
primary biliary cirrhosis, granulomatous hepatitis, and sclerosing cholangitis, inflammatory bowel disease, 

30 gluten-sensitive enteropathy, and Whipple's disease, autoimmune or immune-mediated skin diseases including 
bullous skin d is eases, erythema multiforme and contact de nnanos. psoriasis, allergic diseases such as asthma, 
allergic rhin i ti s, atopic der m a tit is, food hypersensitivity and urticaria, immunologic disrasc? of the hmg such as 
eosinophilic pneumonias, idiopathic pulmonary fibrosis and hypersensitivity pneumonitis, transplantation 
associated diseases including graft rejection and graft -versus-host-disease. 

35 

9- The composition or use of any of the preceding claims, wherein the agonist or antagonist is a 
monoclonal antibody. 

10. The composition or use of any of the preceding n»iiwg wherein the agonist or antagonist is 
40 an antibody fragment or a single-chain antibody. 
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11. The composition or use of claims 9 or 10, wherein the antibody has nonhuman 
complementarity determining region (CDR) residues and human framework region (FR) residues. 

5 12. A method for detennining the presence of a PRO200, PRO204, PR0212. PR0216, PR0226, 

PRO240. PR0235. PR0245, PR0172, PR0273. PR0272, PR0332. PR0526. PRO70K PR0361. PR0362, 
PR0363. PR0364, PR0356, PR0531. PR0533, PRO1083. PR0865. PRO770, PR0769, PR0788, PROl 114, 
PROI007. PR01184, PRO1031. PR01346, PROl 155. PRO1250. PROI312, PROl 192. PR01246, PR01283, 
PROl 195, PR01343, PR01418, PR01387, PRO14I0. PR01917, PR01868. PRO205. PR021, PR0269, 

10 PR0344. PR0333. PR0381. PRO720, PR0866, PRO840, PR0982. PR0836. PROl 159, PR01358, PR01325, 
PR01338. PR01434, PR04333, PRO4302, PRO4430, PR05727 polypeptide, comprising exposing a cell 
suspected of containing the polypeptide to an ami- PRO200, anu-PRO204 % anu-PR0212, anti-PR0216, ami- 
PR0226. anti-PRO240, anti-PR0235, ami-PR0245. anti-PR0172, antt-PR0273, ami-PR0272, ano-PR0332, 
anri-PR0526. anti-PRO701. anti-PR0361, anu-PR0362. anti-PR0363. anti-PR0364. ami-PR0356, anti- 

15 PR053I. ami-PR0533. anti-PRO1083. anti-PR0865. ami-PRO770. anu-PR0769. anti-PR0788, ami- 
PROllU. ami-PRO1007. ami-PROU84. ami-PRO103I. anti-PRO ! 346. ami-PROH55. anti-PROl250, ami- 
PR01312. anti-PROH92. ami-PR01246. ami-PROI283, anti-PROU95. ami-PRO!343. anti-PRO!418. anti- 
PR01387. ami-PROl4IO, anti-PR019l7, anti-PROl868. ami-PRO205, ami-PR021. ami-PR0269. anti- 
PR0344. ami-PR0333. anu-PR0381, ann-PRO720, anti-PR0866. anti-PRO840. anti-PR0982. anti-PR0836, 

20 ami-PROl 159. ami-PR01358. ami-PR01325, anti-PR01338. anti-PR01434, anti-PR04333. anti-PRO4302, 
anti-PRO4430, anti-PR05727 antibody, respeenvdy, and determining binding of the antibody to the cell. 

13. A method of diagnosing an immune related disease in a mammal, comprising detecting the 
level of expression of a gene encoding a PRO200, PRO204. PR0212, PR02I6, PR0226. PRO240. PR0235, 

25 PR0245. PR0172, PR0273. PR0272. PR0332. PR0526. PRO701. PR0361. PR0362, PR0363. PR0364, 
PR0356. PR0531. PR0533. PROI083. PR0865. PRO770. PR0769. PR0788. PR01114. PRO1007. 
PR01184. PRO1031. PR01346. PROU55. PRO1250. PR01312. PROH92. PR01246. PR012S3. PROU95, 
PR01343. PROM 18, PR01387, PRO1410, PR01917. PROI868, PRO205, PR021, PR0269, PR0344, 
PR0333, PR0381, PRO720. PR0866. PRO840. PR0982, PR0836. PROl 159. PR01358, PR01325. 

30 PRO 1338. PRO 1434, PR04333, PRO4302. PRO4430 or PR05727 polypeptide (a) in a test sample of tissue 
cells obtained from the mammal, and (b) in a control sample of known normal tissue cells of the same cell 
type, wherein a higher or lower expression level in the test sample as compared to the control indicates the 
presence of immune related disease in the mammal from which the test tissue cells were obtained. 

35 14. A method of diagnosing an immune related disease in a mammal comprising (a) contacting 

an anti-PRO200. anti-PRO204, anti-PR0212, anti-PR0216. anti-PR0226. anti-PRO240. anti-PR0235, am> 
PR0245, anti-PROl 72, anti-PR0273, ann-PR0272, am>PR0332, anri-PR0526, ann-FRO701, anti-PR0361, 
anti-PR0362, anti-PR0363, anti-PR0364, anri-PR0356 f anti-PR0531, ami-PR0533, anu-PROl083, anti- 
PR0865, anti-PRO770, antt-PR0769, anti-PR0788, anti-PROl 114, anti-PRO1007. anti-PROl 184, anti- 

40 PRO103 1, anti-PR0l346, anti-PROl 155, anti-PRO!250, ami-PR013 12, anri-PROl 192, anri-PR01246, axrt> 
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PR01283. anti-PROl 195. anti-PR01343. anti-PR01418. anti-PR01387, anti-PR014lO. anti-PR019l7. and- 
PROI868. anti-PRO205, anri-PR021, anu-PR0269. anu-PR0344, anti-PR0333. anti-PR0381. anti-PRO720, 
anti-PR0866, anri-PRO840. ami-PR0982. aau-PR0836. anri-PROI 159. anri-PR01358, anti-PROl 325, and- 
PR01338, anu-PR01434, ami-PR04333. anti-PRO4302. anti-PRO4430 or anti-PR05727 antibody wich a test 
5 sample of tissue ceils obtained from the mammal, and (b) detecting the formation of a complex between the 
antibody and the polypeptide in the test sample. 

15. An immune related disease diagnostic kit comprising an anu-PRO200, anu-PRO204. anti- 
PR0212. anti-PR0216. anti-PR0226. anti-PRO240, ami-PR0235. anti-PR0245, ami-PR0172. anri-PR0273. 

10 anri-PR0272. ami-PR0332, anti-PR0526, anu-PRO701. anti-PR0361. anti-PR0362. am>PR0363, ami- 
PR0364, anti-PR0356. anti-PR0531. anti-PR0533. anri-PRO1083. anti-PR0865. anti-PRO770. anu-PR0769, 
ami.PR0788. anti-PROl 114, anti-PRO1007. anu-PROI 184. anti-PROl03l. anti-PRO!346. anti-PROl 155, 
anri-PRO1250. anti-PR01312. anri-PROU92, anti-PROl 246. anti-PR01283. antt-PROM95, ami-PR01343, 
anti-PR01418. ami-PR0I387. ami-PROI410. anu-PROl9I7. anti-PROl868, anri-PRO205. ami-PR02l. anti- 

1 5 PR0269. ami-PR0344. anti-PR0333. anii-PR038 1 . anti-PR0720. anti-PR0866. anti-PRO840. anti-PR0982. 
ami-PR0836. anti-PROl 159. anu-PR0!358. anti-PROl325. ann-PR01338. ami-PROl434. anu-PR04333. 
anti-PRO4302. anti-PRO4430 or anti-PR05727 antibody or fragment thereof and a carrier in suitable 
packaging. 

20 16. The kit of claim 15. further comprising instructions for using the antibody to detect a 

PRO200. PRO204. PR0212. PR0216. PR0226. PRO240. PR0235, PR0245. PR0172.PR0273. PR0272, 
PR0332. PR0526. PRO701. PR0361. PR0362. PR0363. PR0364. PR0356, PR0531. PR0533. PRO1083, 
PR0865. PRO770. PR0769. PR0788, PR0III4. PRO1007. PROH84. PRO1031. PR01346. PROU55, 
PRO1250. PR01312. PR01192. PR01246. PR01283. PRO! 195. PROI343. PR014I8. PR01387. PRO14I0, 

25 PR01917. PR01868. PRO205. PR021. PR0269. PR0344. PR0333. PR0381. PRO720. PR0R66. PRO840. 
PR0982. PR0836. PROU59. PR01358. PROI325. PR0133S, PRO 1434. PR04333. PRO4302. PRO4430 or 
PR05727 polypeptide. 

1 7. An article of manufacture, comprising: 
30 a container. 

an instruction on the container; and 

a composition comprising an active agent contained within the container, wherein the composition is 
effective for inhibiting or reducing an immune response in a mammal the instruction on the container indicates 
that the composition can be used for treating an immune related disease, and the active agent in the 

35 composition is an agem inhibiting the expression and/or activity of a PRO200. PRO204. PR0212, PR021 6. 
PR0226. PRO240. PR0235. PR0245. PR0172, PR0273, PR0272, PR0332, PR0526, PRO701, FR0361, 
PR0362, PR0363, PR0364. PR0356, PR0531, PR0533, PRO1083. PR0865, PRO770, PR0769, PR0788, 
PROUI4. PRO1007. PR0I184, PRO1031 f PRO 1346. PRO II 55, PRO1250, PR01312, PROH92, PR01246, 
PROI283, PR01I95. PR01343, PR01418, PR01387, PRO1410, PR01917, FR01868, PRO205, PR021, 

40 PR0269. PR0344, PR0333. PR038 1, PRO720, PRO866, PRO840, PR0982. PR0836, PROl 159, PR01358. 
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PR01325. PR01338. PRO 1434, PR04333. PRO4302, PRO4430 or PR05727 polypeptide 

IB. The article of manufacture of claim 17 wherein said active agent is an anti-PRO200, anri- 
PRO204. ami-PR02I2, anti-PR0216, anti-PR0226, ami-PRO240, ami-PR0235. ann-PR0245 ? anti-PROI72, 
5 anti-PR0273, anti-PR0272, ami-PR0332, anti-PR0526, ami-PRO701, anti-PR036K anri-PR0362, anri- 
PR0363, anu-PR0364, anri-PR0356, anti-PR0531, anu-PR0533, anti-PRO1083, anti-PR0865, anti-PRO770, 
anti-PR0769, anti-PR0788. anti-PROl 1 14, anti-PRO1007, anri-PR01184, anti-PROl03l. anti-PR01346, 
ami-PROU55, ami-PRO1250. ami-PR013i2, anti-PROl 192, anti-PR01246. anii-PR01283, ami-PROII95, 
ami-PROl343. ami-PR01418. anti-PR01387. anti-PROl4lO, anti-PR019l7. anti«PR01868, ami-PRO205, 
10 anti-PR021, anti-PR0269, anti-PR0344. anti-PR0333, anu-PR0381. anti-PRO720, ami-PR0866. anti- 
PRO840, anu-PR0982, anri-PR0836, ami-PROU59, ami-PROl358. anti-PROI325. ami-PR01338, antt- 
PROI434. anti-PR04333. anti-PRO4302, ami-PRO4430 or ami-PR05727 antibody. 

f: 
t 

19. A method for identifying a compound capable of inhibiting the expression or activity of a 
15 PRO200. PRO204. PR02I2, PR0216. PR0226. PRO240. PR0235. PR0245. PR0172. PR0273. PR0272. 

PR0332. PR0526. PRO701. PR036I, PR0362. PR0363. PR0364. PR0356. PR0531, PR0533. PRO1083, 
PR0865. PRO770, PR0769. PR07R8, PROM 14. PRO1007. PR01184. PRO1031, PROI346. PROU55. 
PRO1250. PROI312. PROU92. PR01246. PR01283, PRO 1 195, PR01343. PR01418. PR01387. PRO14I0. 
PR01917. PROI868. PRO205. PR021, PR0269, PR0344, PR0333, PR038L PRO720. PR0866. PRO840, 
20 PR0982. PR0836. PROl 159, PRO 1358, PR01325. PR01338, PR01434, PR04333. PRO4302, PRO4430 or 
PR05727 polypeptide, comprising contacting a candidate compound with the polypeptide under conditions and 
for a time sufficient to allow these two components to interact. 

20. The method of claim 19. wherein the candidate compound or the PRO200. PRO204. 
25 PR0212. PR0216. PR0226. PRO240, PR0235. PR0245. PR0172. PR0273. PR0272. PR0332. PR0526, 

PRO701. PR0361. PR0362. PR0363. PR0364. PR0356. PR053L PROS33. PRO1083. PR0865. PRO770, 
PR0769. PR0788. PRO! 1 14. PRO1007. PR01184. PRO103I. PR01346. PR01155. PRO1250. PR01312, 
PROl 192. PR01246, PR01283, PR01I95. PR01343, PR01418. PR01387. PRO1410. PR01917, PR01868, ^" 
PRO205. PR021. PR0269, PR0344, PR0333. PR0381, PRO720, PR0866, PR0840. PR0982, PR0836, 
30 PROl 159. PR01358. PR01325, PR01338. PR01434, PR04333, PRO4302, PRO4430 or PR05727 
polypeptide is immobilized on a solid support. 

21. The method of claim 20. wherein the non-immobilized component carries a detectable label. 

35 22. Isolated nucleic acid having at least 80% nucleic acid sequence identity to a nucleotide 

sequence that encodes an amino acid sequence selected from the group consisting of the amino acid sequence 
shown in Figure 1 (SEQ ID NO:l)» Figure 3 (SEQ ID NO: 1 1), Figure 5 (SEQ ID NO: 13), Figure 7 (SEQ ID 
NO: 18), Figure 9 (SEQ ID NO:20), Figure 1 1 (SEQ ID NO:25), Figure 13 (SEQ ID NO:30), Figure 15 (SEQ 
ID NO:35), Figure L7 (SEQ ID NO:40), Figure 19 (SEQ ID NO:45), Figure 21 (SEQ ID NO:50), Figure 23 

40 (SEQ ID NO:56), Figure 25 (SEQ ID NO:61 ). Figure 27 (SEQ ID NO:66), Figure 29 (SEQ ID NO:71 ), Figure 
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31 (SEQ ID NO:79), Figure 33 (SEQ ID NO:86), Figure 35 (SEQ ID N0:91). Figure 37 (SEQ ID NO:I01), 
Figure 39 (SEQ ID NO:I06), Figure 41 (SEQ ID N0:111), Figure 43 (SEQ ID N0:116X Figure 45 (SEQ ID 
NOI23). Figure 47 (SEQ ID NOU33). Figure 49 (SEQ ID N0139), Figure 51 (SEQ ID NO:l41), Figure 53 
(SEQ ID NO:143), Figure 55 (SEQ ID NO-.145). Figure 57 (SEQ ID NO:I47). Figure 59 (SEQ ID NO:149). 
Figure 61 (SEQ ID NOM51), Figure 63 (SEQ ID NOI56), Figure 65 (SEQ ID NO:158), Figure 67 (SEQ ID 
NO:160), Figure 69 (SEQ ID NOM62), Figure 71 (SEQ ID N0167). Figure 73 (SEQ ID NO.169). Figure 75 
(SEQ ID NO:177), Figure 77 (SEQ ID NO:179). Figure 79 (SEQ ID NO-.184). Figure 81 (SEQ ID NO:186), 
Figure 83 (SEQ ID NO:188). Figure 85 (SEQ ID NO:190). Figure 87 (SEQ ID NO:192), Figure 89 (SEQ ID 
N0228). Figure 91 (SEQ ID NO:230), Figure 93 (SEQ ID NO:232). Figure 95 (SEQ ID NO:240), Figure 97 
(SEQ ID NO:248). Figure 99 (SEQ ID NO:250), Figure 101 (SEQ ID NO.255). Figure 103 (SEQ ID NO:257), 
Figure 105 (SEQ ID NO:266), Figure 107 (SEQ ID NO:268). Figure 109 (SEQ ID NO:270). Figure 1 1 1 (SEQ 
ID NO:272). Figure 113 (SEQ ID NO-.274). Figure 115 (SEQ ID NO:276). Figure 117 (SEQ ID NO:278), 
Figure 1 19 (SEQ ID NO:280>, Figure 121 (SEQ ID N0485). Figure 123 (SEQ ID NO:292), Figure 125 (SEQ 
ID NO:294) or Figure 127 (SEQ ID NO.296. 



23. Isolated nucleic acid having at least 80% nucleic acid sequence identity to a nucleotide 
sequence selected from the group consisting of the nucleotide sequence shown in Figure 1 (SEQ ID NO: I). 
Figure 3 (SEQ ID NO: II), Figure 5 (SEQ ID NO:l3). Figure 7 (SEQ ID NO:I8). Figure 9 (SEQ ID NO:20), 
Figure 1 1 (SEQ ID NO-.25). Figure 13 (SEQ ID NOJ0), Figure 15 (SEQ ID NOJ5). Figure 17 (SEQ ID 
NO:40). Figure 19 (SEQ ID NO:45), Figure 21 (SEQ ID NO:50). Figure 23 (SEQ ID NO:S6). Figure 25 (SEQ 
ID NO:61), Figure 27 (SEQ ID NO:66). Figure 29 (SEQ ID NO:71), Figure 31 (SEQ ID NO:79), Figure 33 
(SEQ ID NO:86). Figure 35 (SEQ ID NO:9I). Figure 37 (SEQ ID NO: 101). Figure 39 (SEQ ID NO:106). 
F.g«re4l (SEQ ID NO:ltl), Figure 43 (SEQ ID NO: 1 16). Figure 45 (SEQ ID NO: 123). Figure 47 (SEQ ID 
NO.133). Figure 49 (SEQ ID NO:139). Figure 51 (SEQ ID NO:I41). Figure 53 (SEQ ID NO:143). Figure 55 
(SEQ ID NO:145). Figure 57 (SEQ ID NO:l47). Figure 59 (SEQ ID NO:149). Figure 61 (SEQ ID NO:l51). 
Figure 63 (SEQ ID NO:156). Figure 65 (SEQ ID NO:158). Figure 67 (SEQ ID NO:160), Figure 69 (SEQ ID 
NO:162). Figure 71 (SEQ ID NO:167), Figure 73 (SEQ ID NO:169). Figure 75 (SEQ ID NO:177). Figure 77 
(SEQ ID NO:179). Figure 79 (SEQ ID NO:lg4). Figure 81 (SEQ ID NO:186). Figure 83 (SEQ ID NO:188). 
Figure 85 (SEQ ID NO190). Figure 87 (SEQ ID N0:I92). Rgure 89 (SEQ ID NO:228). Figure 91 (SEQ ID 
NO:230). Figure 93 (SEQ ID NO.232), Figure 95 (SEQ ID NO:240). Figure 97 (SEQ ID NO:248), Figure 99 
(SEQ ID NO:250). Figure 101 (SEQ ID NO:255), Figure 103 (SEQ ID NO^ST). Figure 105 (SEQ ID 
N0266). Figure 107 (SEQ ID NO:268). Figure 109 (SEQ ID NO270). Figure 1 1 1 (SEQ ID NO:272). Figure 
113 (SEQ ID NO:274), Figure 115 (SEQ ID NO-.276). Figure 117 (SEQ ID NO:278), Figure 119 (SEQ ID 
NO280), Figure 121 (SEQ ID NOi85), Figure 123 (SEQ ID NO:292), Figure 125 (SEQ ID NOJ94) or 
35 Figure 127 (SEQ ID NO:296. 



24. Isolated nucleic acid having at least 80% nucleic acid sequence identity to a nucleotide 
sequence selected from the group consisting of the mil-length coding sequence of the nucleotide sequence 
shown in Figure 1 (SEQ ID Nat). Figure 3 (SEQ ID NO:ll), Figure 5 (SEQ ID NO:l3). Figure 7 (SEQ ID 
NO:18). Figure 9 (SEQ ID NO.20), Figure 1 1 (SEQ ID NQ.2S). Figure 13 (SEQ ID NO-JO). Figure 15 (SEQ 
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ID NO:35), Figure 17 (SEQ ID NO:40), Figure 19 (SEQ ID NO:45). Figure 21 (SEQ ID NO:50), Figure 23 
(SEQ ID Na.56), Figure 25 (SEQ ID N0:61), Figure 27 (SEQ ID NO;66), Figure 29 (SEQ ID N0:71), Figure 
31 (SEQ ID NO:79), Figure 33 (SEQ ID NO:86), Figure 35 (SEQ ID N0:91), Figure 37 (SEQ ID NO:101), 
Figure 39 (SEQ ID NO:106), Figure 41 (SEQ ID NO:ll 1), Figure 43 (SEQ ID NO:116), Figure 45 (SEQ ID 
5 NO:123), Figure 47 (SEQ ID NO:133), Figure 49 (SEQ ID NO:139), Figure 51 (SEQ ID NO:141), Figure 53 
(SEQ ID NO: 143), Figure 55 (SEQ ID NO:145), Figure 57 (SEQ ID N0:147), Figure 59 (SEQ ID NO:149), 
Figure 61 (SEQ ID NO:151), Figure 63 (SEQ ID NO:156), Figure 65 (SEQ ID NO:l58) f Figure 67 (SEQ ID 
NO:160), Figure 69 (SEQ ID NO:162), Figure 71 (SEQ ID NO:167X Figure 73 (SEQ ID NO:169), Figure 75 
(SEQ ID N0.177), Figure 77 (SEQ ID NO:179), Figure 79 (SEQ ID N0:184), Figure 81 (SEQ ID NO:186), 

10 Figure 83 (SEQ ID NO: 188), Figure 85 (SEQ ID NO: 190), Figure 87 (SEQ ID NO: 192), Figure 89 (SEQ ID 
NO:228), Figure 91 (SEQ ID NO:230), Figure 93 (SEQ ID NO:232), Figure 95 (SEQ ID NO:240). Figure 97 
(SEQ ID NO:248), Figure 99 (SEQ ID NO:250), Figure 101 (SEQ ID NO:255), Figure 103 (SEQ ID NO:257), 
Figure 105 (SEQ ID NO:266), Figure 107 (SEQ ID NO:268). Figure 109 (SEQ ID NO.270), Figure 1 1 1 (SEQ 
ID N0.272), Figure 1 13 (SEQ ID NO:274), Figure 1 15 (SEQ ID NO:276), Figure 1 17 (SEQ ID NO:278). 

15 Figure 1 19 (SEQ ID NO:280), Figure 121 (SEQ ID NO:285), Figure 123 (SEQ ID NO:292), Figure 125 (SEQ 
ID N0:294) or Figure 127 (SEQ ID NO:296. 

25. Isolated nucleic acid having at least 80% nucleic acid sequence identity to the roll-length 
coding sequence of the DNA deposited under ATCC accession number 209653, 209380, 209254, 209381, 
20 209376, 209260, 209374. 209265, 209419, 209424, 209388, 209433. 209704, 209720. 209621, 209620, 
209616, 209436. 209422, 209702, 209480, 209859, 209774, 209801, 209802, 209849, 209905. 209950, 
209962. 209866, 203128, 209986, 203173, 203132, 203093, 203457, 203244. 203094, 203282. 203276, 

203160, 203277, 203573. 203553, , 209456, 209397, 209492. , 209808. 209802. 209750, 209858, 

203583. 209989, 203092. 203 131, 203269. 203267, 203657. 203818, 203834, 23-PTA. 203906. 



25 



30 



26. A vector comprising the nucleic acid of any one of Claims 22 to 25. 

27. The vector of Claim 26 operably linked to control sequences recognized by a host cell 
transformed with the vector. 



28. A host cell comprising the vector of Claim 26. 

29. The host cell of Claim 28. wherein said cell is a CHO cell 
35 30. The host cell of Claim 28, wherein said cell is an £ colL 

31. The host cell of Claim 28, wherein said ceil is a yeast cell. 

32. A process for producing a PRO200, PRO204, PR0212, PR02I6, PR0226, PRO240, 
40 PR0235, PR0245, PR0172, PR0273, PR0272, PR0332, PR0526, PRO701, PR036I, PR0362, PR0363, 
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PR0364, PR0356, PR0531, PR0533. PROJ083, PR0865, PRO770, PR0769, PR0788. PROU 14, PRO1007, 
PR01184, PRO1031, PR01346, PRD1I55, PRO1250, PR01312, PROII92, PRO 1246, PR01283, PR01195, 
PR01343, PROW18. PROI387, PRO1410, PROI9I7, PR01868, PRO205, FR021, PR0269, PR0344, 
PR0333, PR0381, PRO720, PR0866, PRO840, PR0982, PR0836, PR01159, PROI358, PR01325, 
PROI338, PR01434, PR04333, PRO4302. PRO4430 or PR05727 polypeptide comprising coinmng the host 
cell of Claim 28 under conditions suitable for expression of said polypeptide and recovering said polypeptide 
from the cell culture. 

33. An isolated polypeptide having at least 80% amino acid sequence identity to an amino acid 
sequence selected from the group consisting of the amino acid sequence shown in Figure 2 (SEQ ID N02), 
Figure 4 (SEQ ID NO: 1 2), Figure 6 (SEQ ID NO: 14), Figure 8 (SEQ ID NO: 19), Figure 10 (SEQ ID NO.21). 
Figure 12 (SEQ ID NO:26), Figure 14 (SEQ ID NO:3i), Figure 16 (SEQ ID NO:36), Figure 18 (SEQ ID 
NO:41), Figure 20 (SEQ ID NO:46). Figure 22 (SEQ ID NO:51), Figure 24 (SEQ ID NO:57), Figure 26 (SEQ 
ID NO:62), Figure 28 (SEQ ID NO:67), Figure 30 (SEQ ID NO:72), Figure 32 (SEQ ID NO:80), Figure 34 
(SEQ ID NO:87). Figure 36 (SEQ ID NO:92). Figure 38 (SEQ ID NO: 1 02). Figure 40 (SEQ ID NO: 107), 
Figure 42 (SEQ ID NO:l 12). Figure 44 (SEQ ID NO:117), Figure 46 (SEQ ID NO:124), Figure 48 (SEQ ID 
NO: 134). Figure 50 (SEQ ID NO: 140), Figure 52 fSEQ ID NO: 1 42), Figure 54 (SEQ ID NO: 144), Figure 56 
(SEQ ID N0.146). Figure 58 (SEQ ID NO:148), Figure 60 (SEQ ID NO:150), Figure 62 (SEQ ID NO:152), 
Figure 64 (SEQ ID N0.157). Figure 66 (SEQ ID NO:159) ( Figure 68 (SEQ ID NO:161), Figure 70 (SEQ ID 
NO:163), Figure 72 (SEQ ID NO:I6B), Figure 74 (SEQ ID NO:I70), Figure 76 (SEQ ID NO: 178), Figure 78 
(SEQ ID NO:I80), Figure 80 (SEQ ID NO:185), Figure 82 (SEQ ID NO:187). Figure 84 (SEQ ID NO:189), 
Figure 86 (SEQ ID NO:19I), Figure 88 (SEQ ID NO:193), Figure 90 (SEQ ID NO:229), Figure 92 (SEQ ID 
NO.231), Figure 94 (SEQ ID NOJ33), Figure 96 (SEQ ID NO:24l). Figure 98 (SEQ ID NO:249). Figure 100 
(SEQ ID NO:25I). Figure 102 (SEQ ID NO:256), Figure 104 (SEQ ID NO:258), Figure 106 (SEQ ID 
NO:267). Figure 108 (SEQ ID NO:269), Figure 1 10 (SEQ ID NO:271). Figure 1 12 (SEQ ID NO:273), Figure 
114 (SEQ ID NO:275), Figure 116 (SEQ ID NO:277), Figure 118 (SEQ ID NO:279), Figure 120 (SEQ ID 
NO:28l). Figure 122 (SEQ ID NO:286), Figure 124 (SEQ ID NO:293), Figure 126 (SEQ ID NO:295) or 
Figure 128 (SEQIDNO:297). 

34. An isolated polypeptide scoring at least 80% positives when compared to an amino acid 
sequence selected from the group consisting of the amino acid se quence shown in Figure 2 (SEQ ID NOJK 
Figure 4 (SEQ ID NO:l2), Figure 6 (SEQ ID NO:l4), Figure 8 (SEQ ID NO:19), Figure 10 (SEQ ID NO:21), 
Figure 12 (SEQ ID NO:26), Figure 14 (SEQ ID NO:31), Figure 16 (SEQ ID NCfc36), Figure 18 (SEQ ID 
NO:4I), Figure 20 (SEQ ID NO:46), Figure 22 (SEQ ID NO:51). Figure 24 (SEQ ID NO:57), Figure 26 (SEQ 
ID NO:62), Figure 28 (SEQ E> NO:67), Figure 30 (SEQ ID NO:72) t Figure 32 (SEQ ID NO:80), Figure 34 
(SEQ ID NO:87), Figure 36 (SEQ ID NO:92), Figure 38 (SEQ ID NO:102), Figure 40 (SEQ ID NO:I07), 
Figure 42 (SEQ ID NO:112X Figure 44 (SEQ ID NO:117), Figure 46 (SEQ ID NO.124), Figure 48 (SEQ ID 
NO:134), Figure 50 (SEQ ID NO: 1 40), Figure 52 (SEQ ID NO;142K Figure 54 (SEQ ID NO:I44), Figure 56 
(SEQ ID Nai46), Figure 58 (SEQ ID NO:148), Figure 60 (SEQ ID NCU50), Figure 62 (SEQ ID NO:152), 
Figure 64 (SEQ ID NChl57), Figure 66 (SEQ ID NO:159), Figure 68 (SEQ ID NO:161), Figure 70 (SEQ fl> 
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NO: 163), Figure 72 (SEQ ID NO: 168), Figure 74 (SEQ ID NO: 170), Figure 76 (SEQ ID NO: 178), Figure 78 
(SEQ ID NO:180), Figure 80 (SEQ ID NO.185), Figure 82 (SEQ ID NO:187), Figure 84 (SEQ ID NO:189), 
Ftgure 86 (SEQ ID NO:19I) f Figure 88 (SEQ ID NO:193), Figure 90 (SEQ ID NO:229), Figure 92 (SEQ ID 
NO:231), Figure 94 (SEQ ID N0033), Figure 96 (SEQ ID NO:241), Figure 98 (SEQ ID NO:249), Figure 100 
5 (SEQ ID NO:25I), Figure 102 (SEQ ED N0256), Figure 104 (SEQ ID NO:258), Figure 106 (SEQ ID 
NO:267), Figure 108 (SEQ ID NO:269), Figure 110 (SEQ ID NO:271), Figure 112 (SEQ ID NO:273), Figure 
114 (SEQ ID NO:275), Figure 1 16 (SEQ ID NO:277), Figure 118 (SEQ ID NO:279), Figure 120 (SEQ ID 
NO-.281), Figure 122 (SEQ ID NO:286). Figure 124 (SEQ ID NO:293), Figure 126 (SEQ ID NO:295) or 
Figure 128 (SEQ ID NO:297). 

10 

35. An isolated polypeptide having at least 80% amino acid sequence identity to an amino acid 
sequence encoded by the full-length coding sequence of the DNA deposited under ATCC accession number 
209653, 209380, 209254, 209381, 209376, 209260, 209374. 209265, 209419, 209424, 209388, 209433, 
209704, 209720. 209621. 209620, 209616, 209436. 209422. 209702. 209480, 209859. 209774. 209801, 
15 209802. 209849. 209905. 209950, 209962. 209866. 203128, 209986. 203173. 203132. 203093. 203457, 

203244. 203094. 203282. 203276, 203160. 203277. 203573. 203553. . 209456, 209397. 209492. , 

209808. 209802. 209750, 209858, 203583. 209989. 203092, 203131, 203269. 203267, 203657, 203818, 
203834, 23-PTA. 203906. 

20 36. A chimeric molecule comprising a polypeptide according to any one of Claims 33 to 35 

fused to a heterologous amino acid sequence. 

37. The chimeric molecule of Claim 36, wherein said heterologous amino acid sequence is an 
epitope tag sequence. 

25 

38. The chimeric molecule of Claim 36. wherein said heterologous amino acid sequence is a Fc 
region of an immunoglobulin. 

39. An antibody which specifically binds to a polypeptide according to any one of Claims 33 to 

30 35. 

40. The antibody of Claim 39, wherein said antibody is a monoclonal antibody, a humanized 
antibody or a single-chain antibody. 

35 41. Isolated nucleic acid having at least 80% nucleic acid sequence identity to: 

(a) a nucleotide sequence encoding the polypeptide shown in Figure 2 (SEQ ID N02), Figure 4 
(SEQ ID NO:12) t Figure 6 (SEQ ID NO:14) t Figure 8 (SEQ ID NO:19), Figure 10 (SEQ ID NO:21), Figure 12 
(SEQ ID N026), figure 14 (SEQ ID NO.31), Figure 16 (SEQ ID NO:36), Figure 18 (SEQ ID NO:4 1), Figure 
20 (SEQ ID NO:46), Figure 22 (SEQ ID NO:51). Figure 24 (SEQ ID NO:57), Figure 26 (SEQ ID NO:62), 

40 Figure 28 (SEQ ID NO:67) f Figure 30 (SEQ ID N0:72), Figure 32 (SEQ ID NO:80), Figure 34 (SEQ ID 
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NO:87), figure 36 (SEQ ID NO*2), Figure 38 (SEQ ID NO:102). Figure 40 (SEQ ID NO:I07), Figure 42 
(SEQ ID N0:1 12), figure 44 (SEQ ID N0:1 17), figure 46 (SEQ ID NO:124). figure 48 (SEQ ID NO:t34), 
figure 50 (SEQ ID NO:140), Figure 52 (SEQ ID NO:142), Figure 54 (SEQ ID NO:144), figure 56 (SEQ ID 
NO: 146), Figure 58 (SEQ ID NO:148), figure 60 (SEQ ID NOrl50), figure 62 (SEQ ID NO:I52), figure 64 
5 (SEQ ID NO:157), Figure 66 (SEQ ID NO:159), figure 68 (SEQ ID N0:I61), figure 70 (SEQ ID NOM63), 
Figure 72 (SEQ ID NO: 168), Figure 74 (SEQ ID NOI70), Figure 76 (SEQ ID NO:178), Figure 78 (SEQ ID 
NO180). Figure 80 (SEQ ID NO:185), figure 82 (SEQ ID NO:187), figure 84 (SEQ ID NO:189), figure 86 
(SEQ ID NO: 19 1). Figure 68 (SEQ ID NO: 193). Figure 90 (SEQ ID NO:229), Figure 92 (SEQ ID NO:23l), 
Figure 94 (SEQ ID NO:233), Figure 96 (SEQ ID NO:241), Figure 98 (SEQ ID NO:249), Figure 100 (SEQ ID 
10 NO.251 ), Figure 102 (SEQ ID NO:256), figure 104 (SEQ ID NO:258). figure 1 06 (SEQ ID NO:267), Figure 
108 (SEQ ID NOS69), Figure 110 (SEQ ID N0271), Figure 112 (SEQ ID N0273), Figure 114 (SEQ ID 
N0.275X Figure 1 16 (SEQ ID NO:277), figure 1 18 (SEQ ID N0279), figure 120 (SEQ ID NO-.281), Figure 
122 (SEQ ID NO:286). Figure 124 (SEQ ID N0293), Figure 126 (SEQ ID NO:295) or Figure 128 (SEQ ID 
NO:297) lacking its associated signal peptide; 
15 < b > a nucleotide sequence encoding an extracclhilar domain of the polypeptide shown in Figure 

2 (SEQ ID NO:2), Figure 4 (SEQ ID N012). Figure 6 (SEQ ID NO:14). Figure 8 (SEQ ED NO:19), Figure 10 
(SEQ ID NO:21), Figure 12 (SEQ ID NO:26), Figure 14 (SEQ ID NO:31). Figure 16 (SEQ ID NO:36), Figure 
18 (SEQ ID NO:41), Figure 20 (SEQ ID NO:46), Figure 22 (SEQ ID NO:5l). Figure 24 (SEQ ID N057), 
Figure 26 (SEQ ID NO.62), Figure 28 (SEQ ID NO:67), Figure 30 (SEQ ID NO:72), Figure 32 (SEQ ID 
20 NO:80), Figure 34 (SEQ ID NO:87), Figure 36 (SEQ ID NO:92), Figure 38 (SEQ ID NO102), figure 40 
(SEQ ID NO:107), Figure 42 (SEQ ID N0:1I2), Figure 44 (SEQ ID N0:1I7), Figure 46 (SEQ ID N0:124), 
figure 48 (SEQ ID NO:134). Figure 50 (SEQ ID NO:I40), figure 52 (SEQ ID NO:142), Figure 54 (SEQ ID 
NO:144). Figure 56 (SEQ ID NO:146), Figure 58 (SEQ ID NO:148), figure 60 (SEQ ID NO:150), Figure 62 
(SEQ ID NO:152), figure 64 (SEQ ID NO:157), Figure 66 (SEQ ID NO:159). figure 68 (SEQ ID N0:I61), 
25 Figure 70 (SEQ ID NO: 163), Figure 72 (SEQ ID NO: 1 68), Figure 74 (SEQ ID NO: 170), Figure 76 (SEQ ID 
NO:178). Figure 78 (SEQ ID NO:180), Figure 80 (SEQ ID NO:185), Figure 82 (SEQ ID NOM87), figure 84 
(SEQ ID NO:I89). Figure 86 (SEQ ID NO.191), Figure 88 (SEQ ID N0:I93), Figure 90 (SEQ ID NO:229), 
Figure 92 (SEQ ID N0031), Figure 94 (SEQ ID NO.233), Figure 96 (SEQ ID NO:241). Figure 98 (SEQ ID 
NOS49), figure 100 (SEQ ID NO.251), Figure 102 (SEQ ID NO;256), Figure 104 (SEQ ID NO:258). Figure 
30 106 (SEQ ID NO:267), Figure 108 (SEQ ID NO:269), Figure 110 (SEQ ID NO:271) r Figure 112 (SEQ ID 
NO.273), Figure 1 14 (SEQ ID NO:275), figure 1 16 (SEQ ID NO-.277), Figure 1 18 (SEQ ID NOi279), Figure 
120 (SEQ ID NOS81), figure 122 (SEQ ID NO.286), Figure 124 (SEQ ID NO:293), Figure 126 (SEQ ID 
NO:295) or Figure 128 (SEQ ID N0.297) with its associated signal peptide; or 

(c) a nudeoude sequence encoding an extracellular domain of the polypeptide shown in Figure 2 
35 (SEQ ID NO.2), Figure 4 (SEQ ID NO:l2), Figure 6 (SEQ ID NO:14), Figure 8 (SEQ ID NO:19), Figure 10 
(SEQ ID NO:21), Figure 12 (SEQ ID NO.26), Figure 14 (SEQ ID NO:3 1), figure 16 (SEQ ID N036), figure 
18 (SEQ ID NO:41) f figure 20 (SEQ ID NO:46), figure 22 (SEQ ID NOSI), Figure 24 (SEQ ID NO:57), 
figure 26 (SEQ ID NO:62), figure 28 (SEQ ID NO:67), figure 30 (SEQ ID NO:72), figure 32 (SEQ ID 
NOS0), Figure 34 (SEQ ID NO:87), figure 36 (SEQ ID NOS2), Figure 38 (SEQ ID NO: 102), Figure 40 
40 (SEQ ID NO.107). figure 42 (SEQ ID NO:l 12), Figure 44 (SEQ ID NO: II 7), Figure 46 (SEQ ID NO:124). 

171 

SUBSTITUTE SHEET (RULE 26) 

(C) 2003 Copyright Derwenl Information Ltd. 



WO 00/53758 FCT/US00/Q5841 

Figure 48 (SEQ ID NO: 134), Figure 50 (SEQ ID NO: 140), Figure 52 (SEQ ID NO: 142). Figure 54 (SEQ ID 
NO:144) t Figure 56 (SEQ ID NO:146), Figure 58 (SEQ ID NO:148), Figure 60 (SEQ ID NO:150), Figure 62 
(SEQ ID NO.152), Figure 64 (SEQ ID N0:157), Figure 66 (SEQ ID NO.159), Figure 68 (SEQ ID NO:161), 
Figure 70 (SEQ ID NO:163), Figure 72 (SEQ ID NO:168), Figure 74 (SEQ ID NO:170), Figure 76 (SEQ ID 
5 NO:178), Figure 78 (SEQ ID NO:180), Figure 80 (SEQ ID NO:185), Figure 82 (SEQ ID NO:187), Figure 84 
(SEQ ID NO-.189), Figure 86 (SEQ ED NO:191), Figure 88 (SEQ ID NO:193), Figure 90 (SEQ ID NO:229), 
Figure 92 (SEQ ID NO:231), Figure 94 (SEQ ID N0233), Figure 96 (SEQ ID NO:241), Figure 98 (SEQ ID 
NO:249), Figure 100 (SEQ ID NO:25I), Figure 102 (SEQ ID NO:256), Figure 104 (SEQ ID NO:258), Figure 
106 (SEQ ID NO:267), Figure 108 (SEQ ID NO:269), Figure 110 (SEQ ID NO:27iX Figure 112 (SEQ ID 
10 NCfc273), Figure 1 14 (SEQ ID NO:275), Figure 1 16 (SEQ ID NO:277) t Figure 1 18 (SEQ ID NO:279). Figure 
120 (SEQ ID NO.281), Figure 122 (SEQ ID NO:286) f Figure 124 (SEQ ID NO.293), Figure 126 (SEQ ID 
NO:295) or Figure 128 (SEQ ID NO:297) lacking its associated signal peptide. 

f -i 

42. An isolated polypeptide having at least 80% amino acid sequence identity to: 

15 (a) the polypeptide shown in Figure 2 (SEQ ID NO:2), Figure 4 (SEQ ID NO: 12), Figure 6 

(SEQ ID N0:I4), Figure 8 (SEQ ID NO:19), Figure 10 (SEQ ID NO:21), Figure 12 (SEQ ID NO:26), Figure 
14 (SEQ ID NO:3I), Figure 16 (SEQ ID NO:36), Figure 18 (SEQ ID NO:4l), Figure 20 (SEQ ID NO:46), 
Figure 22 (SEQ ID N03I), Figure 24 (SEQ ID NO:57), Figure 26 (SEQ ID NO:62), Figure 28 (SEQ ID 
NO:67). Figure 30 (SEQ ID NO:72), Figure 32 (SEQ ID NO:80), Figure 34 (SEQ ID NO:87), Figure 36 (SEQ 

20 ID NO:92), Figure 38 (SEQ ID NO:102), Figure 40 (SEQ ID NO:107), Figure 42 (SEQ ID NO: 1 12), Figure 44 
(SEQ ID NO: 1 17), Figure 46 (SEQ ID NO: 124), Figure 48 (SEQ ID NO: 134), Figure 50 (SEQ ID NO: 140), 
Figure 52 (SEQ ID NO:142), Figure 54 (SEQ ID NO:144), Figure 56 (SEQ ID NO:146). Figure 58 (SEQ ID 
NO:148), Figure 60 (SEQ ID NO:I50), Figure 62 (SEQ ID NO:152), Figure 64 (SEQ ID NO:157). Figure 66 
(SEQ ID NO:159), Figure 68 (SEQ ID NO:16I), Figure 70 (SEQ ID NO:163), Figure 72 (SEQ ID NCU68), 

25 Figure 74 (SEQ ID NO: 170), Figure 76 (SEQ ID NO: 178), Figure 78 (SEQ ID NO: 180), Figure 80 (SEQ ID 
NO;185), Figure 82 (SEQ ID NO:l87), Figure 84 (SEQ ID NO:189), Figure 86 (SEQ ID NO:I91), Figure 88 
(SEQ ID NO:193), Figure 90 (SEQ ID NO:229), Figure 92 (SEQ ID NO:231), Figure 94 (SEQ ID NO:233), 
Figure 96 (SEQ ID NO:241), Figure 98 (SEQ ID NO:249), Figure 100 (SEQ ID NO:251 ), Figure 102 (SEQ ID 
NO:256), Figure 104 (SEQ ID NO:258), Figure 106 (SEQ ID NO:267), Figure 108 (SEQ ID NO:269), Figure 

30 1 10 (SEQ ID NO:271), Figure 1 12 (SEQ ID NO:273), Figure 1 14 (SEQ ID NO:275), Figure 1 16 (SEQ ID 
NO-.277), Figure 118 (SEQ ID NO:279), Figure 120 (SEQ ID N0281), Figure 122 (SEQ ID NO:286), Figure 
124 (SEQ ID NO:293), Figure 126 (SEQ ID NO:295) or Figure 128 (SEQ ID NO:297), lacking its associated 
signal peptide; 

(b) an extracellular domain of the polypeptide shown in Figure 2 (SEQ ID NO:2), Figure 4 (SEQ 
35 ID NO:12), Figure 6 (SEQ ID NO: 14), Figure 8 (SEQ ID NO:I9), Figure 10 (SEQ ID NO:21), Figure 12 (SEQ 
ID NO:26), Figure 14 (SEQ ID NOJ1), Figure 16 (SEQ ID NO:36), Figure 18 (SEQ ID NO:41), Figure 20 
(SEQ ID NO:46)» Figure 22 (SEQ ID NO:5 1). Figure 24 (SEQ ID NO:57), Figure 26 (SEQ ID NO:62), Figure 
28 (SEQ ID NOr67), Figure 30 (SEQ ID NO;72), Figure 32 (SEQ ID NO:80), Figure 34 (SEQ ID NO:87), 
Figure 36 (SEQ ID NOS2), Figure 38 (SEQ ID NO:102), Figure 40 (SEQ ID NO:107), Figure 42 (SEQ ID 
40 NO:1 12), Figure 44 (SEQ ID NO:l 17), Figure 46 (SEQ ID NO:I24), Figure 48 (SEQ ID NO:134), Figure 50 
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(SEQ O NO: 140), Figure 52 (SEQ ID NO: 142), Rgnre 54 (SEQ ID NO: 144), Figwe 56 (SEQ ID NO: 146), 
Figure 58 (SEQ ID NO:148). Figure 60 (SEQ ID NO:150), Figure 62 (SEQ ID NO: 152), Figure 64 (SEQ ID 
NO:!57). Figure 66 (SEQ ID NO: 159), Figure 68 (SEQ ID NO:l6t), Figure 70 (SEQ ID NO:163), Figure 72 
(SEQ ID NO-.168), Figure 74 (SEQ ID NO:170), Figure 76 (SEQ ID NO:178), Figure 78 (SEQ ID NO:180), 
Figure 80 (SEQ ID NO: 185), Figure 82 (SEQ ED NO: 187), Figure 84 (SEQ ID NO: 189), Figure 86 (SEQ ID 
NO: 191), Figure 88 (SEQ ID NO: 193), Figure 90 (SEQ ID NO:229), Figure 92 (SEQ ID N0:23l), Figure 94 
(SEQ ID NO:233). Figure 96 (SEQ ID NO:24l), Figure 98 (SEQ ID NO-.249), Figure 100 (SEQ ED N0251), 
Figure 102 (SEQ ID NO:256). Figure 104 (SEQ ED N025S), Figure 106 (SEQ ID NO:267), Figure 108 (SEQ 
ID NO-.269), Figure 110 (SEQ ED NO:271), Figure 112 (SEQ ID NO:273), Figure 114 (SEQ ID NO:275), 
Figure 1 16 (SEQ ID N0277). Figure 1 18 (SEQ ED N0279), Figure 120 (SEQ ID NO:281). Figure 122 (SEQ 
ID NO:286). Figure 124 (SEQ ID NO:293), Figure 126 (SEQ ID NO:295) or Figure 128 (SEQ ID NO-.297), 
with its associated signal peptide; or 

(e) an extracellular domain of the polypeptide shown in Figure 2 (SEQ ID NO:2). Figure 4 (SEQ 
ID NO:12). Figure 6 (SEQ ID NO:14), Figure 8 (SEQ ID NO:19), Figure 10 (SEQ ID NO:21). Figure 12 (SEQ 
ID NO:26), Figure 14 (SEQ ID NO:31). Figure 16 (SEQ ID NO:36), Figure 18 (SEQ ID NO:41). Figure 20 
(SEQ ID NO:46). Figure 22 (SEQ ID NO:5l). Figure 24 (SEQ ID NO:57). Figure 26 (SEQ ID NO:62), Figure 
28 (SEQ ID NO:67). Figure 30 (SEQ ID NO:72), Figure 32 (SEQ ID NO:80). Figure 34 (SEQ ID NO:87), 
Figure 36 (SEQ ID NO:92), Figure 38 (SEQ ID NO:102), Figure 40 (SEQ ID NO:107), Figure 42 (SEQ ID 
NO:112), Figure 44 (SEQ ID NO:117), Figure 46 (SEQ ID NO:124). Figure 48 (SEQ ID NO:l34), Figure 50 
(SEQ ID NO: 140), Figure 52 (SEQ ID NO: 142). Figure 54 (SEQ ID NO: 144). Figure 56 (SEQ ID NO:146), 
Figure 58 (SEQ ID NO: 148). Figure 60 (SEQ ID NO: 150), Figure 62 (SEQ ID NO: 152), Figure 64 (SEQ ID 
NO:157), Figure 66 (SEQ ID NO:I59). Figure 68 (SEQ ID NO:161), Figure 70 (SEQ ID NO:163). Figure 72 
(SEQ ID NO:168), Figure 74 (SEQ ID NO:170), Figure 76 (SEQ ID NO:178), Figure 78 (SEQ ID NO.180). 
Figure 80 (SEQ ID NO: 185), Figure 82 (SEQ ID NO: 1 87), Figure 84 (SEQ ID NO: 189). Figure 86 (SEQ ID 
NO:19I), Figure 88 (SEQ ID NO: 193). Figure 90 (SEQ ID N0029). Figure 92 (SEQ ID NO:231), Figure 94 
(SEQ ID NO:233), Figure 96 (SEQ ID NO:241). Figure 98 (SEQ ID NO:249), Figure 100 (SEQ ID NO:251), 
Figure 102 (SEQ ID NO.256), Figure 104 (SEQ ID NO-.258), Figure 106 (SEQ ID NOJ67). Figure 108 (SEQ 
ID N0269), Figure 110 (SEQ ID NO:27I). Figure 112 (SEQ ID NO:273). Figure 114 (SEQ ID NO-.275), 
Figure 1 16 (SEQ ID NO:277). Figure 1 18 (SEQ ID NO:279). Figure 120 (SEQ ID NO:28I). Figure 122 (SEQ 
ID N0286). Figure 124 (SEQ ID N0293). Figure 126 (SEQ ID NO:295) or Figure 128 (SEQ ID NO:297). 
lacking its associated signal peptide. 

43. A method of affecting the proliferation ofT-cells comprising contacting PBMC cells with an 
effective amount of a PRO200. PRO204, PR02I2. PR0216, PR0226. PRO240. PR0235. PR0245. PR0172. 
PR0273. PR0272, PR0332, PR0526. PRO701, PR0361. PR0362. PR0363. PR0364. PR0356, PR0531, 
PR0533. PRO1083, PR0865. PRO770. PR0769, PR0788. PR01I14, PROI007. PROl 184, PRO1031, 
PR01346. PROl 155, PROI250, PR01312, PROl 192, PR01246. PR012M, PROl 195, PR01343, PR01418, 
PR01387. PROI410, PR01917, PR01868. PRO205, PR021, PR0269. PR0344. PR0333. PR0381, PRO720, 
PR0866, PRO840. PR0982. PR0836, PROl 159, PR0I358, PR01325, PR01338, PROM34, PR04333, 
PRO4302. PRO4430 or PR05727 polypeptide and measuring the change in proliferation from control levels. 
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44. A TPf^h"^ of affecting vascular permeability comprising injecting a test animal with an 
effective amount of a PRO200, PRO204, PR0212, PR02I6, PR0226, PRO240, PR0235, FR0245, PR0172, 
PR0273, PR0272, PR0332, PR0526, PRO701, PR036I, PR0362, PR0363, PR0364, PR0356, PR0531, 
5 PR0533. PRO1083. PR0865, PRO770, PR0769, PR0788, PROH14, PRO1007, PROH84. PRO1031, 
PR01346, PR01155, PRO1250, PR01312, PROU92, PR01246, PROU83, PR01195, PR01343, PR01418, 
PR01387, PRO1410, PR01917, PR01868, PRO205, PR021, PR0269, PR0344. PR0333, PR0381, PRO720, 
PRO866. PRO840, PR0982, PR0836, PROH59, PR01358, PR01325. PR01338, PR01434. PR04333, 
PRO4302, PRO4430 or PROS727 polypeptide, and measuring the resulting extent of vascular permeability. 
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Figure I 

CGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGCTGGTTCAGGTC 

CAGGTTTTGCTTTGATCCnTTTCAAAAACTC 

TTTTGGATGGGATTATGTGGAAACTACCCTGC^ 

GCTTCCACCCCA6TGCAGC CTTCCCCTGGCGGTGGTGAAAGAGACTCGGGAGTCGCTGCT 
TCCAAAGTGCCCGCCGTGAGTGAGCTOTCACCCCAGTCAGCCAA 

ATGAGCCTCTTCGGGCTTCTCCTGCTGACATCTGCCCTGGCCGGCCAGAGACAGGGGACT 
CAGGCGGAATCCAACCTGAGTAGTAAATTCCAGTTTTCC^^ 

GTACAAGATCCTCAGCATGAGAGAATTATTACTGTGTCTACTAATGGAAGTATTCA 
CCAAGGTTTCCTCATACTTATCCAAGAAATACGGTCTTG 

GAGGAAAATGTATGGATACAACTTACG TTTGATGAAAGATTTGGGCTTGAAGACCCAGAA 
GATGACATATGCAAGTATGATTTTGTAGAAGTTGAGGAACCCACT 
GGGCGCTGGTGTGGTTCTGGTACTGTACCAGGAAAACAGATTTCTAAA 
AGGATAAGATTTGTATCTGATGAATATTTTCCTTCTGAACCA^ 

AACATTGTCATGCCACAATTCACAGAAG CTG TGAGTCCTTCAG TGCTACCCCCTTCAGCT 
TTGCCACTGGACCTGCTTAATAATGCTATAACTGCCTTTAGT^ 

CGATATCTTGAACCAGAGAGATGG CAGTTGGACTTAGAAGATCTATATAGGCCAACTTGG 

CAACTTCTTGGCAAGGCTTTTGTTTTTGGAAGAAAATCCAGAGTGGTGG^ 

CTAACAGAGGAGGTAAGATTATACAGCTGCACACCTCGTAACTTCTCAGTGTCCATAAGG 

GAAGAACTAAAGAGAACCGATACCATTTTCTGGCC^GGTTGTCTCCTGGTTA^ 

GGTGGGAACTGTGCCTGTTGTCTCCACAATTGCAATGAATGTCAATGTGTCCCAAGCA^ 

GTTACTAAAAAATACCACGAGGTCCTTCAGTTGAGACCAAAGACCGGTGTC^ 

CACAAATCACTCACCGACGTGGCCCTGGAGCACCATGAGGAGTGTGACTGTGTCT 

GGGAGCACAGGAGGATAGCCGCATCACCACCAGCAGCTCTTGCCCAGAGCTGTGCAGTGC 

AGTGGCTGATTCTATTAGAGAACGTATGCGTTATCTCCATCCTTAATCTCAGTO 

TTCAAGGACC TT TCATCTTCAGGATTTACAGTGCATT 

AATTAGGAGTTGTGCAACAGCTCTTTTGAGAGGAGGCCT^ 

T CAATCGTGGAAAGAAAATTAAATGTTGTATTAAATAGATCAC CAG CTAGTTT CAGAGTT 
ACCATGTACGTATTCCACTAGCTGGGTTCTGTATTTCAGTTCTTT 

CGATACGG CTTAGGGTAATGTCAGTACAGGAAAAAAACTGTGCAAGTGAGCAC CTGATTC 
CGTTGCCTTGCTTAACTCTAAAGCTCCATGTCCTGGGCCTAAAATCGTATAAAATCTC 
TTTTTTTTTTTTTTTTTG CTCATATTCACATATGTAAACCAGAACATTCTATGTACTACA 
AACCTGGTTTTTAAAAAGG^CTATGTTGCTATGAATTAAA 

ACAGACTGGA1T1TTCATAU , 1 U 1 , C1 , 1ATTAAAA , X^TCTGCCATTTAGAAGAAGAGAACTAC 
ATTCATGGTTTGGAAGAGATAAACCTGAAAAGAAGAGTGGCCTTATCTT 
TAAGTCAGTTTATTTGTTTCATTGTG TA CATTTTTATATTCTC CTTTTGACATTATAACT 
GTTGGCTCTTCTAATCTTGTTAAATATAT 

TTTATGACAACTTAGATCAACTATTTTTAG CTTGGTAAATTTTO 

TAGCCAGAGGAACAAAGATGATATAAAATATTGTTGCTCTGACAAAAATACATGTATTTC 
ATTCTCGTATGGTGCTAGAGTTAGATTAATCTG C^TTTTAAAAAACTGAATTGGAATAGA 
ATTGGTAAGTTGCAAAGACri"ll JUAAAATAATTAAATTATCATATCTTCCATTCCTGTT 
ATTGGAGATGAAAATAAAAAG CAACTTATGAAAGTAGACATTCAGATCCAG CCATTACTA 
ACCTATTCCTTTTT IXj GGGAAATCTGAGCCTAGCTCAGAAAAACATAAAGCACCTTGAAA 
AAGACTTGGCAGCrrCCTGATAAAGCGTGCTGTGCrGTGCAGTAGGAACACATC 
ATTGTGATGTTGTGGTTTTATTATCTTAAACTCTGTTCCATACACTTC 
GATATTTTTATGTACAGAAGTATGTCTCTTAACCAGTTCACTTATTGTACTCTGGCAACT 
TAAAAGAAAATCAG TAAAATA1 T~i~j. G u\rr Lf I AAAATGCTT AATATNGTG C CTAG U tiaTG 
TGGTGACTATTTGAATCAAAAATGTATTGAATCATCAAATAAAAGAATGTGG CTATTTTG 
GGGAGAAAATTAAAAAAAAAAAAAAAAAAAAAGGTTTAGGGATAACAGGGTAATGCGGCC 
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Figure 2 

MSLFGLlXLTSAI^GQRQCrrQAESNL^SKPQFSSNKEQNGVQDPQHERIITVSTNGSIHS 
PRFPHTYPRNTVLVWRLVAVEEHVntflQ^ 

GRWCGSGTVPGKQISKGNQIRIRFVSDEYFPSEPGH3HYNIVMPQFTEAVSre 
LPUDLLNNAITAFSTI^URYI^ERWQI^^ 

LTEEVRLYSCTPRNFSVSIREELKRTOTItWPGCLLVKRCGGhKTACCLHNCNECQCVPSK 
VTKKYHEVLQLRPKTGVRGLHKSLTDVALEHHEECDCVCRGSTGG 
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Figure 3 

TGCCGGGCTGCGGGGCGCCTTGACTCTCCCTCCACCCT 
TGCCCCTGGACTCCCGTCTCCTCCTGTCCTCC^ 

GCAGCTTCCCGCGTCTCCGGCGCAGCTTCTCAGCGGACGACCCTCTCGCTCCGGGGC^ 

GCC&GTCCCTGGATGTTGCTGAAACTCTa^ 

CCCGCCGGGTGCC^CTGCCACCGCCGCCGCCTCTC 

TAGCCCGCTGCCCGGCCCCCGCGATCCTGTGTTCCTCGGAAGCCGTTTC 

TTGCACGAACTAGTC 

ATOGTGCTGTGGGAGTCCCCGCGGC^^ 

TGGCTGCTGCTGCTGCCCGTCATGCTACTCATCGTAGCCCGC 
TTCCCTACCTCCTTAAGTGACTGCCAAACGCCCACCGGCTGGAATTGCTCTGGTT^ 
GACAGAGAAAATGATCTCTTCCTCTGTGACACCAACACCTGTAAATTTGA 
TTAAGAATTGGAGACACTGTGACTTGCGTCTGTCAGTTCAAGTC 

TGCAAACAGCAGAGTGAGATACTTGTGGTCTC^GAAGGATCATGTGCCACAGATO 

TCAGGATCTGGAGATGGAGTCCATGAAGGCTCTGGAGAAACTAGTCAAAAGGAGACATrc 

ACCTGTGATATTTGCCAGTTTGGTGCAGAATGTGACGAAGATGCCGAGGATCT 

GTGTGTAATATTGACTGTTCTCAAACCAACTTCAATCCCCTCTGCGCTTCTGATGG^^ 

TCTTATGATAATGCATGCCAAATCAAAGAAGCATCGTGTCAGAAACAGGAGAAAATTGA^ 

GTCATGTCTTTGGGTCGATGTCAAGATAACACAACTAC^ 

CATTATGCAAGAACAGATTATGCAGAGAATGCTAACAAATTAGAAGAAAGTGCCAGA^ 

CACCACATACCTTGTCCGGAACATTACAATGGCTTCTGCA 

TCTATGAATATGGAGGAGCCATCTTGCAGGTGTGATC 

TTAATCGCAGCTGTGATTGGAACAATTCAGATTGCTGTCATCTGTG 

TCCTCTGCATCACAAGGAAATGCCCCAGAAGCAAC^^ 

CAGGGCACTACAGTTCAGLACAATACAACAAGAGCGTCCACGAGGTTJ^ 

ATGTTTCACAGTGGCTGGACTACCGAGAGCTTGGACTACACAATACAGTATT^ 

AAGAATAAGACAAGAGATCTACACATGTTGCCTTC 

AAACATGTACTACAGCTATATTTGATTATGTATGGATATATTTGAAATAGTATA 
CnTGATG7*rriTl'CTGTAATGTAAATAAACTATTTATATCACACAATATAG riTTTT L TT 
TCCCATGTATTTGTTATATATAATAAATACTCAGTGATGAG 
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Figure 4 

MVLVreSPRQCSSWTLCEGFCWLLIJj^ 

DDRENDLFLCiriOTCKFDGECIJlIGDTVTCVCQFKCNNDYW 

AACRQQSEILWSEGSCATDAGSGSGDGVHEGSGETSQKETSTCDICQFGAECDEDABD 
WCVOTIDCSQTNFNPLCASDGKS YDNACQI KEAS CQKQEKI EVMSLGRCQDNTTTTTK 
S EDGHYARTDYAENANKLEESAREHHI PCPEHYNGFCMHGKCEHS INMQEPSCRCDAGY 
TGQHCEKXDYS VLYVVPGPVRFQ YVLI AAVT GTI Q I AVI CVWLCITRKCPRSNRIHRQ 
KQNTGHYS SDNTTRASTRLI 
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Figure 5 

TCCGCAGGCGGACCGGMGCA^ 

GTCCGCGCTGAGCCGCGCTCTCCCTGCTCCAGCAAGGACC 

ATQAGGGCGCTGGAGGGGCCAGGCCTGTCGCTGCT^ 

CTXSCTGCCGGTGCCGGCTGTACGCGGAGTGGCAG^ 

GCAGAGACAGGGGAGCGGCTGGTGTGCGCCCAGTGCCCCCCAGGCACCTTTG^ 

CCX5TGCCGCCGAGACAGCCCCACGACGTGTGGCCCGTGTCCACCGCGCCACT 

TTCTGGAACTACCTGGAGCGCTGCCGCTACTGCAACGTCCT 

GAGGCACGGGCTTGCCAC^CCACCCACAACCGTGCCTGCCGCTGCCGCA 

GCGCACGCTGGTTTCTGCTTGGAGCACGCATCGTGTCCACCTGGTC 

CCGGGCACCCCCAGCCAGAACACGCAGTGCCAGCCGTGCCCCCCAGGCACCTTCTCAGCC 

AGCAGCTCCAGCTCAGAGCAG7GCCAGCCC 

CTCAATGTGCCAGGCTCTTCCTCCCATGACACCC^ 

CTCAGCACCAGGGTACCAGGAGCTGAGGAGTGTGAGCGTGCCGTCATCGACTTT^ 

TTCCAGGACATCTCCATCAAGAGGCTGCAGCGGCTGCTGCAGGCCCTCGAGGCC^ 

GGCTGGGGTCCGACACCAAGGGCGGGCCGCGCGGCCTTGCAGCTGAAGCTGCGTCGGCGG 

CTCACGGAGCTCCTGGGGGCGCAGGACGGGGCGCTGCTCGTGCGGCTGCT 

CGCGTGGCCAGGATGCCCGGGCTGGAGCGGAGCGTCCGTGAGCGCTTCCTCCCTGTG 

T^TCCTGGCCCCCTCTTATTTATTCTACATCCITGGCACCCCACTTGCACT 

uriirrrrr AAATAGAAGAAATGAGG lTrCTl ' A AAAAAAAAAAAAAAAAAAAAA 
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Figure 6 

MRALEGPGLSIJjCLVIALPALLPV^ 

RPCRRDSPTTCGPCPPRHYTQPW^ERCRYam-CGEKEEEARACHATHNRACRCRTG 

FFAHAGFCI^HASCPPGAGVIAPGTPSQNTQCQPCPPGTFSASSSSSEQCQPHRNCTAI/ 

GLALNVPGSS SHDTLCTSCTGFPLSTRVPGAEECERAVT DFVAFQD I S I KRLQRLLQAL 

EAPEGWGPTPRAGRAAI^LKLRRRLTELLGAQ^^ 

FLPVH 
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Figure 7 

CAOUUSCATCTTAATTTGAATCC^ 

AATTCAACCCGAGTGTTTTCOUU^GATTGTA 

GAGACAGCCCTGTCTGGACACAGAGTTACrGTGGATT^ 

TTAGGAATTTCTGATTCATTTAAAGGArTTACAAATTC^ 

AAATTGAACAGGAAAAAAAAAAAGAAG 

ATGG <j ' IT TT T T A AGTCCAATATATGTT A ' 1"1 TTCT lVnTlU ' I XSGAGTCAAAGTACATTGC 

CAATATGAAACTTATCAGTGGGATGAAGACTATGACCAAGAGCa 

ACAGGATTCCCATTTCGTCAAAATGTAGACTACGGAGTTCCTTTTCATCA 

GGCTGTGTCAGTGAATGCTTCTGTCCAACTAACTT^ 

CGCAAACTCAAGACTATCCCAAATATTCCGATGCACATTCAGCAA 

AATGAAATTGAGGCTGTGACTGCAAATTCATTCT^ 

AACCTCAGCCACAACAAAATTAAATCTCAAAAGATTGATTATGGTGTGTCT 
CCAAATCTACTACAACTTCATCTAGAGCATAATAATTTAGAAGAATTTCCA 

GCTATGGATGGGCTAGTAAACTTGACCATG CTTGATCTCTGTTATAATTATCTTCATGAT 
TCTCTGCrrAAAAGACAAAATCTTTGCCAAAATGGAAAAACT 

AG TAACAGATTAGAATCAATG C CTC CTGIJT1TGC C1TCTTCACTT ATGTATLrr l>t CLTVA 

GAAAAT AATTCAATTT CTTCTATAC CCG AAAAATACTTCG ACAAACTTC CAAAACTT CAT 

ACTCTAAGAATG T CA CACAACAAACTACAAGACA TCC CAT AT AATA']"1"1"1 U 1' AATCTTC C C 

AACATTGTAGAACTCAGTGTTGGACACAACAAATO 

AATTTGGAACACCTATACCTACAAAATAATGAAATAGAAAAGAT^ 

TGTCCTTCTATTGACCCACTACATTACCACCATT^ 

AAACTAAAAGAACCAATAAGCTCATACATCTTCTTCTGCTT 

TATTATGGTGAACAACGAAGCACTAATGGTCAAACAATACAACT 

AGGAGATTTCCAGATGATGATGATGAAAGTGAAGATGACGATGA 

GAGAGCCCAGAACAAGAAGGAGCAGAAGGGCACTTTGACCTTCATTATT 

ATGAAAATCAAGA ATAG CAAGAAACTATATAGGTATA(^CTTACGACTTCA 

TACTTAATATAGTAAATCTAAGTAAACATGTATTACTCAAAGTAATATATTT 

GTATTAGTATAAGATCAGAATTGAATTTAAGTTGTTGGT 

GATTAGAACTTACTCAAAATAATGTAAATCTTTAAAAATATAAATT 

GAATCATAAATTAAACGTTAATGGTTTCTTATGCTC^ 

TAAAGAAAAAAAAAAAAAA 
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Figure 8 

MGFLS P I YVT FFFFGVKVHCQ YETYQWD EDYDQE PDDDYQTG F PFRQNVD YG VPFHQ YTL 

GCVSECFCPTNFPSSMYCDNRKLKTIPNIPMHIQQLYLQFNEIEAVTAKSFINATm 

NLSHHKIKSQKIDYGVFAKLPNLLQLHLEHNNI^^ 

AMDGLVNLTMLDLCYNYLHDSLLKDKI FAKMEKLMQLNLCSNRLESMPPGLPSSLMYIiSL 
ENNS I SSI PEKYFDKLPKLHTLRMSHNKLQDI PYNI FNLPNI VELS VGHNKLKQAFY1PR 
OT-EHLYLQNNEIEKMNLTVMCPSIDPLHYHHLTYIRVDQNiaKEPIS 
YYGEQRSTNGQTIQLKTQVFRRFPDDDDESEDHDDPDNAHESPEQEGAEGHFDLHYYENQ 

E 
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Figure 9 

CCCAAGCCAGCCGAGCCGCCAGAGCCGCGGGCCGCGGGGGTGTCGCGGGCCCAACCCCAG 
G 

ATOCTCCCCTGCGCCTCCTGCCTACCCGGGTCTCTACTGCTCTGGGCGCTC 
CTCTTGGGATCAGCTTCTCCTCAGGATTCTGA^ 

GATGGCTATGAGTGGGACCCAGACAGCCAGCACTGCCGGGATGTCAACGAG^ 

ATCCCTGAGGCCTGCAAGGGGGAAATGAAGTGCATCAACC^^ 

CTGCCCCGCTCCGCTGCCGTCATCAACGACCTACATCGCGAGGGACCCCCGCCA 

CCTCCXGCTCAACACCCCAACK:CCTGCCCACC^^ 

TGTGTGGATGTGGACGAGTGTGCCCAGGCCCTGCACGACTGTC^ 

CATAACTTGCCTGGCTCCTATCAGTGCACCTGCCCTGATGGTTACCGCAAGATCGGGCCC 
GAGTGTGTGGACATAGACGAGTGCCGCTACCGCTACTGCCAGC^ 

CCTGGCTCCTTCCGCTCCCAGTGCGAGCCGGGCTTCCAGCTGGGGCCTAACAACCGCTCC 
TGTGTTGATGTGAACGAGTG TGACATGGGGGCCCCATGCGAGCAGCGCTG CTTCAACTCC 

TGCAGTGATATTGATGAGTGTAGCTACTCCAGCTACCTCTGTCAGTACCGCTGCGT^AAC 

GAGCCAGGCCGTTTCTCCTGCCACTG CCCACAGGGTTACCAGCTGCTGGCCACACGCCTC 

TGCCAAGACATTGATGAGTGTGAGTCTGGTtSCGCACCAGTGCTCCGAGGCCCAAACC^ 

GTCAACTTCC^TGGGGGCTACCGCTGCGTGGACACCAACCGCTGCGTGGAGCCCTACAT^ 

CAGGTCTCTGAGAACCGCTGTCTCTGCCCGGCCTCCAACCCTCTATGTCGAGAGCAGCCT 

TCATCCATTGTGCACCGCTACATGACCATCACCTCGGAGCGGAGCGTGCCCGCTGACGTG 

TTCCAGATCCAGGCGACCTCCGTCTACCCCGGTGCCTACAATGCCTTTCAGATCCGTGCT 

GGAAACTCGCAGGGGGACTTTTACATTAGGCAAATCAACAAC^ 

CTCGCCCGGCCGGTGACGGGCCCCCGGGAGTACGTGCTGGACCTGGAGATGGTCACCATG 

AATTCCCTCATGAGCTACCGGGCCAGCTCTGTACTGAGGCTCACCGTCT^ 

TACACCTTCTGAGGAGCAGGAGGGAGCCACCCTCCCTGCAGCTACCCTAGCTGAGGAGCC 

TGTTGTGAGGGGCAGAATGAGAAAGGCAATAAAGGGAGAAAGAAAGTCCTGGTC 

GTGGGCGGGTCACACTGCAGGAAGCCTCAGG 

CTGGGG CAGGGTGGCACTTGGGGGGGCAGGCCAAGTTCACCTAAATGGGGGTCTCTATAT 
GTTCAGGCXCAGGGGCCCCCATTGACAGGAGCTGGGAGCTCTGCA 

ACCCCGAGAGGAGAGGAGGTAACGAGGAGGGCGGACTCCAGGCCCCGGCCCAGAGATTTG 
GACTTGGCTGGCTTGCAGGGGTCCTAAGAAACTCCACTCTGGACAGCGCCAGGAGG CCCT 
GGGTTCCATTCCTAACTCTGCCTCAAACTGTACATTTGGATAAGCCCTAGTAGTTCCCTC 
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Figure 10 

MLPCASCLPGSIJIJ^WALLIJJjL^ 

IPEACKGEMKCINHYGGYLCLPRSAAVINDLHGEGPPPPVPPAQHPNPCPPGYEPDDQDS 
CVDVDECAQALHIXrRPSQDaiNLPGSYQCTCPDG 

PGSFRCQCEPGFQLGPNNRSCVDVNECDMGAPCEQRCFNSYGTFLCRCH^ 
CSDIDECSYSSYLCQYRCVNEPGRFSCHCPQGYQLIATRLCQDIDECESGAHOCSEAQTC 
VNFHGGYRCVDTNRCVEPYIQVSENRCXCPASNPLCREQPSSIVHRYMTITSERSVPADV 
FQI QATSVYPGAYlU^QIRAGNSQGDFYIRQINWSAhaiVLARPWGPREYVLDL 
NSLMSYRASSVLRLTVFVGAYTF 
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Figure 11 

GGGAACGGAAA 

ATQG CGCCTCACGGCCCGGGTAGTCTTACGACCCTGGTGCCCTGGGCTGCCGCCCTGCT^ 

CTCGCTCTGGGCGTGGAAAGGGCTCTGGCGCTACCCGAGATATGC^ 

AG CGTGCAAAATTTGTCAAAAGTGGCCTTTTATTGTAAAACGACACGAGAG 

CATGCCCGTTGCTGCCTGAATCAGAAGGGCACCATCTTGGGGCTGGATCTC 

TCTCTGGAGGACCCTGGTCCAAACTTTCATCAGGCA^ 

CAAGCAAACCCCCTCAAAGGTGACTTGGCCAACACCTTCCGTCGCTTT^ 

ACTCTGATACTGCCACAACATGTCAACTGTCCTGGAGGAATT^ 

ACCTCTTATATAGACAACCAAATCTGTCAAGGGCAAAAGAACCTTTC 

GACCCAGAAATGTGTCCTGAGAATGGATCTTGTGTACCT^ 

TGTGTTTGTGCTGATGGTTTCCATGGATACAAGTGTATGCGCCAGGGCT 

CTTATGTTCTTCGGGATTCTGGGAGCCACCACTCTAT CC GTCTCCATTCTG CTTTGGG CG 

ACCCAGCGCCGAAAAGCCAAGACTTCATGAACTACATAGGTCTTAC CATTGACCTAAGAT 

CAATCTGAACTATCTTAGCCCAGTCAGGGAGCTCTGCTTCCTAGAA^ 

AGTGGATTCGCCTCAAGGTTGAGGCXGCCATTGGAAGATGAAAAATTC 

GTAGACAAATACCAGCTCCCATTGGTGTTGTTG 

AAAAAAAAAAAAAAAAAAAAA 
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Figure 12 

MAPHGPGSLTTLVPWAAAIXLALGVERAL 
HARCCLNQKGmGLDLQNCSI^PGPNFHQ^ 
TLILPQHVNCPGGINAWhnTTSYIDNQICQGQK^^ 
CVCAIXiFHGYKCMRQGSFSIXMFFGILGATTL^SILLWATQRRKAKTS 



c 



(" 
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Figure 13 

CGAGGGCTTTrCCGGCTCCGGAATGGCACATGTGGGAATC 
ATTTTTCCCTTTCCTAACAAGTTCTAACAGCTGTTCTAACA^ 
TCTTGCTGGAGAAGAAAGGGCTGAGGGCAGAGCAGGGCACTCT^ 
TCCTTGCCTCTCTGTGGATAACAGAGCATG^ 

GATGGAAGTCTAAAATAGGAAGGAATTTTGTGTG CAATATCAGACTCTGGGAGCAGTTGA 

CCTGGAGAGCCTGGGGGAGGGCCTGCCTAACAAGCTTT 

CTGGGCTGGGATAAGACGTCCCGGTAGGATAGGGAAGACTGG 

ATTGACTGGCTGGGTGAACTTCAACAGCCTTTTAACCT 

CTTAAGGGGCCAGAAATAGAGATGCTriCTAAAATAAAATT^ 

TATAGCATAAAGGCTAGAGACCAAAATAGATAACAGGATTCCCTGAACATTCCT 

GAGAAAGTATGTTAAAAATAGAAAAACCAAAATG CAGAAGGAGGAGACTCACAGAGCTAA 

ACCAGG 

ATGGGGACCCTGGGTCAGGCCAGCCTCTTTGCTCCTCCCG^ 

CACTCTG CCTTGTGT'l' 1TGCAGAATCATGTGAGGGCCAACCGGGGAAGGTGGAGCAGATG 
AGCACACACAGGAGCCGTCTCCTCACCGCCGCCCCTCTCAGCATGGAACAGAGGCAGCCC 
TGGCCCCGGGCCCTGGAGGTGGACAGCCGCTCTGTGGTCCTGCTCTCAGTGGTCTGGGTG 
CTGCTGGCCCCCCCAGCAGCCGGCATGCCTCAGTT^ 

GACTGGACCTTCAACCACTTGACCGTCCACCAAGGGACGGGGGCCGTCTATGTGGGGGCC 

ATCAACCGGGTCTATAAGCTGACAGGCAACCTGACCATCCAGGTGGCTCATAAGACAGGG 

CCAGAAGAGGACAAC^GTCTCGTTACCCGCCCCTCATCGTGCAGCCCTGCAGCG 

CTCACCCTCACCAACAATGTCAACAAGCTGCTCATCATTGACT 

CTGGCCTGTGGGAGCCTCTACCAGGGGGTCTGCAAGC^ 

ATCCTGGTGGAGCCATCCCACAAGAAGGAGCACTACCTGTCCAGTGTC^ 

ACCATGTACGGGGTGATTGTGCGCTCTGAGGGTGAGGAT 

GCTGTGGATGGG^GCAGGATTACTTCCCGACCCT^ 

CCTGAGTCCTCAGCCATGCTCGACTATGAGCTACACAGCGATTTTGTCTCCT 

AAGATCCCTTCAGACACCCTGGCCCTGGTCrrCCCACTTTGACATC 

TTTGCTAGTGGGGGCTTTGTCTACTTTCTCACTGTCCAGCCCGAGACCCCTC 

GCCATCAACTCCGCTGGAGACCTCTTCTACACCTCACGCATCGTGCGGCTCTGCAAGGAT 

GACCCCAAGTTCCACTCATACGTGTCCCTGCCCTTCGGCTGCACCCGGGCCGGGGTGGAA 

TACCGCCTCCTGCAGGCTGCTTACCTGGCCAAGCCTGGGGACTCACT 

AATATCACCAGCCAGGACGATGTACTCTTTGCCATCTTCTCCAAAGGGCAGAA 

CACCACCCGCCCGATGACTCTGCCCTGTGTGCCTTCCCTATCCGGGCCATCAACTTGCAG 

ATCAAGGAGCGCCTGCAGTCCTGCTACCAGGGCGAGGGCAACCTGGAGCTCAACTGGCTG 

CTGGGGAAGGACGTCCAGTGCACGAAGGCGCCTGTCCCCATCGATGATAACTTCTGTGGA 

CTGGACATCAACCAGCCCCTGGGAGGCTCAACTCCAGTGGAGGGCCT 

ACCAGCAGGGACCGCATGACCTCTGTGGCCTCCTACGTTTACAACGGCTACAGCGTGGCT 

TTTGTGGGGACTAAGAGTGGCAAGCTGAAAAAGGTAAGAGTCT^ 

AATGCCATTCACCTCCTCAGCAAAGAGTCCCT^ 

GATTTAACTATAGGCAACTTTATTTTCTTGGGGAACAAAGGTC 

GGGGTTAATTTTGTGAC1TAGCTTCTAGCTACTTCCT 

TAAGGAATGCAAGIOTATTTCAATATTTCCCA^^ 

CATCTGCAAAAGCAAA 
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Figure 14 

MGTI^QASLFAPPGNYFWSDHSALCFAESCEGQPGKVEQMSTHRSRLLTAAPLSMEQROP 
WPRALETOSRSVVU^VVWVIJAPPAAGMPQF^ 

INRVYKLTGNLTI QVAHKTG PEEDNKSRYPPLIVQPCSEVLTLTNNVNKLLI IDYSENRL 
IJVCGSLYQGVCKLLRLDDLF ILVEPSHKKEHYLSSVNKTC GT 
AVDGKQDYFPTLSSRKLPRD PESSAMLDYELHS DFVSSL I KI PSDTLALVSHFDI FYX YG 
FASGGFVYFLTVQPETPEGVAINSJIGDLFYTSRIVRLCKDDPKFHSYVSLPFGCTRAGVE 
YRLLQAAYLAKPGDSLAQAFNI TSODDVLFAI FSKGQKQYHHPPDDSALCAFPI RAINLQ 
I KERLQSCYOGEGm*ELNWLMKDVQCTKAPVP IDDNFCGLD INQPLGGSTPVEGLTLYT 
TSRDRMTSVASYVYNGYSVVFVGTKSGKLKKTOV^ 
NYRQLYFLGEQR 
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Figure 15 



CCCAGAAGTTCAAGGGCCCCCGGCCTCCTGCGCTCCTGCCGC CGGGACCCTCGACCTCCT 

CAGAGCAGCCGGCTGCCGCCCCGGGAAG 

ATGGCGAGGAGGAGCCGCCACCGCCTCCTCCTGCTGCTGCTC 

CTGGGCTATCATAAGGCCTATGGGTTTTCTGCC^ 

GTAGAGTACCAAGAGGCTATTTTAGCCTGCAAAACCCCAAAG 



GGTGATTTTAAAAATCGAGCTGAGATGATAGATTTCAAT^ 

AGAAGTGATGCGGGGAAATATCGTTGTGAAGTTAGTGCCCCATCTGAGCAAGG 

CTGGAAGAGG&TACAGTCACTCTGGAAGTATTAGTGGCTC 

GTACCCTCTTCTGCTCTGAGTGGAACTGTGGTAGAGCTACGAT^ 

AATCCAGCTCCTGAATACACATGGTTTAAGGATGGCATC 

CTTGGCTCCCAAAGCACCAACAGCTCATACACAATGfcAATACAA^ 

TTTAATACTGTTTCCAAACTGGACACTGGAGAATATTCCIX3TGAAGC^ 

GGATATCGCAGGTGTCCTGGGAAACGAATGCAAGTAGATGATCTCAACATAAGTGGCATC 

GCTCAGAGGJUIA^GCTACTTTTCAAAAGAAACCTCCTTCCAG 
AAGAGTAATTCTTCATCTAAAGCCACGACAATGAGTGAAAAT^ 

GTAATCCCAGCACTTTGGAAGGCCGCGGCGGGCGGATCACGAGGTCAGGAGTTCTAGACC 

AGTCTGGCCAATATGGTGAAACCCCATCTCTACTAAAATACAAAAATTAGCTC 

TGGCATGTG CCTGCAGTTCCAGCTGCTTGGGAGACAGGAGAATCACTTGAACCCGGGAGG 

CGGAGGTTGCAGTGAGCTGAGATCACGCCACTGCAGTCCAGCCTGGGTAACAGAGCAAGA 

TTCCATCTCAAAAAATAAAATAAATAAATAAATAAATACTGGCT 

TTACAATAAATATAGCTTGATATTC 
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Figure 16 

MARRSRHRLllJXLRYLVVALGra 
UEWKIOGRSVSFVYYQQTI^GDFK^^ 

LEEDTVTIJEVLVAPAVPSCTVPSSAI^CTVVELRCQDKEGNPAPEYTWFKM 
LGSQSTNSSYTMhnTKTGTLQF>n^ 

1AAVVWALV1SVCGLGVCYAQRKGYFSKETSFQKSNSSSKATTMSEKVQWLTPVIPALW 
KAAAGGSRGQEF 



16/133 



(C) 2003 Copyright Derwent Information Ltd. 



WO 00/53758 



PCT/USOO/05841 



Figure 17 



TGGGGGCCCCCCAGGCTCGCGCGTGGAGCGAAGCAGCATC 



GGGGTGTTCGAACTGAAGCTGCAG^ 

AATTGCTGCCGCGGGGGCGCGGGGCCACCGCCGTGCGCCTGCOGGACCTTCTTC 

TGCCTCAAGCACTACCAGGCCAGCGTGTCCCCCGAGCCGCCCTGCACCT^ 

GTCACCCCCGTGCTGGGCGTCGACTCCTT»GTCTGC 

GCGTTCAGCAACCCCATCCGCITCCCCTTCG^ 

ATTATTGAAGCTCTCCACACAGATTCTCCTG^ 

CTCATCAGCCGCCTGGCCACCCAGAGGCACCTGACGGTGGGCGAGGAGTGGTCCC^ 

CTGCACAGCAGCGGCCG<^CGGACCT(^GTACTC 

TACTACGGAGAGGGCTGCTCCGTTTTCTGCCGTCCCCGGGACC^ 

ACCTGTGGGGAGCGTGGGGAGAAAGTGTGCAACCCTGGCTGGAAAGGGCCCT 

GAGCCGATCTGCCTGCCTGGATGTGATGAGCAGCATGGATTT^ 

TGCAAGTGCAGAGTGGGCTGGCAGGGCCGGTACTGTGACGAGTGTATCCGCTATCCA 

TGTCTCCATGGC^CCTGCCAGCAGCCCTGGCAGTGCAACTG^ 

CTTTTCTGCAACCAGGACCTGAACTACTGCACACACCA^ 

ACCTGCACCAACACGGGCCAGGGGAGCTACACTTGCTCTTGCCGGCCTGGGTACACAGCT 
GCCACCTGCG^CTGGGGATTGACGAGTGTGACCCCAGCCCTTC 

TCCACX3GATCTCGAGAACAGCTACTCCTGTACCTGCCCACC CGGCTTCTACGG CAAAATC 
TGTGAATTGAGTGCCATGACCTGTGCGGACGGCCCTTGCTTTAACGGGGGTCGGTGCTCA 
GACAGCrcCGATGGAGGGTACAGCTGCCGCTGCCCCGTGGGCTACTCCGGCTTCAACTCT 
GAGAAGAAAATTGACTACTGCAGCTCTTCACCCTGTTCTAATC 

CTCGGTGATGCCTACCTGTGCCGCTGCCAGGCCGGCTTCTCGGGGAGG CACTGTGAOGAC 

AACGTCGACGACTGCGCCTCCTCCCCGTGCGCCAACGGGGGCACCTGCCGGGATGGOGTG 

AACGACTTCTCCTCCACCrGCCCGCCTGGCTACACGGGCAGGAACTGCAGTGCCCCOT 

AGCAGGTGCGAGCACGCACCCTGCC^CAATGGGGCCACCTGCCACGAGAGGGGCCACCGC 

TATGTGTGCGAGTGTGCCCGAGGCTACGGGGGTCCCAACTGCCAGTTCCTG^ 

CTGCCCCCGGGCCCAGCGGTGGTGGACCTCACTGAGAAGCTAGAGGGCCAGGGCGGGCCA 



GCCGCTGTGGTGGTCTGCGTCCGGCTGAGGCTGCAGAAGCACCGGCCCCCAGCCGACCCC 

TGCCGGGGGGAGACGGAGACCATGAACAACCTGGCCAACTGCCAGCGTGAGAAGGACATC 

TCAGTCAGCATCATCGGGGCCACGCAGATCAAGAACACCAACAAGAAGGCGGACTTCCAC 

GGGGACCACAGCGCCGACAAGAATGGCTTCAAGGCCCGCTACCCAGCGGTGGACTATAAC 

CTCGTGCAGGACCTCAAGGGTGACGACACCGCCGTCAGGGACGCGCACAGCAAGCGTGAC 

ACCAAGTGCCAGCCCCAGGGCTCCTCAGGGGAGGAGAAGGGGACCCCGACCACACTCAGG 

GGTGGAGAAGCATCTGAAAGAAAAAGGCCGGACTCGGGCTGTTCAACTTCAAAAGA 

AAGTACCAGTCGGTGTACGTCATATCCGAGGAGAAGGATGAGTGCGTCATAGO^CTGAG 

GTGTAAAATGGAAGTGAGATGGCAAGACTCCCGTTTCTCTTAAAATAAGTAAAAT^ 

GGATATATGCCCCAACGAATGCTGCTGAAGAGGAGGGAGG 

CCTCGTCGACTGCTGCTGAGAAACCGAGTTCAGACCGAGCAGGTTCTCCTCCTGAGGTCC 

TCGACGCCTGCCGACAGCCTGTCGCGGCCCGGCCGCCTGCGGCACTGCCTTCCGTGACGT 

CGCCGTTGGACTATGGACAGTTGCTCTTAAGAGAATATATATTT^ 

TTACGCATAAGAAGCATGCACTGCCTGAGTGTATATTTTGGATC 

TTCTnSAATTAGAAACACAAACACTGCCTTTATTGTC 

TTT CTAGATGGAAAAGATGTGTGTTATTTTTTGGATTTGTAAAAAT^ GATAT 
CTGTAAAGCTTGAGTAi 11 iGTGATGTTCGT'l"l"I , TTATAATTTAAAT , l ,, l"lGGTAAATATG 
TACAAAGGCACTTCGGGTCTATGTGACTATATTTTTTTGTATATAAATGTATTTATG^ 
TATTGTGCAAATGTTATTTGAG1T7 rTTACTGU - ITlGTTAATGAAGAAATTC CTTTTTAA 
AATATTTTTCCAAAATAAATTTTATGAATGACAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



CCTGGCGGTGCTCTCGGI 



iGGTCTGGAGCTCT 




'CCTTGTCCTCATGCTGCTGCTGGGCTGT 
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Figure 18 

MGSRC&IAIAVLSAI^CQWSSGVFEIJCLQEF^ 

FRVCLKHYQASVSPEPPCTYGSAVTPVliGVDSFSLPDGGGADSAFSNPIRFPFGFTVrPG 
TFSLII EALHTDS PDDLATENPERLI SRIATQRHLTVGEEWSQDLHS SGRTDLKYS YRF 
VCDEHYYGEGCSVFCRPRPDAFGHFTCGERGEKVCNPGWXGPYCTEPICLPGCDEQHGF 
CDKPGECKCRVGWQGRYCDECIRYPGOJJGTCQQ^^ 

KPCraJGATCTNTCQGSYTCSCRPGYTGATCELGI DECDPSPCKNGGS CTDLENS YSCTC 
P PGFYGKI CELSAMTCADGPCFNGGRCSDSPDGGYS CRCPVGYSGFNCEKKI DYCS SS P 
CSNGAKCVDLGDAYLCRCQAGFSGRHCDDNVDDCAS S PCANGGTCRDGVNDFSCTCPPG 
YTGRNCSAPVSRCEHAPCHNGATCHERGHRYVCECARGYGGPNCQFI^PELPPGPAVVP 
LTEKLEGQGGPFPWVAVCAGVILVLMLLLGCAAVWC^ 

NKLANCQREKDI S VS 1 1 GATQ I KNTNKKADFHGDB SAD KNGFKARYPAVDYNLVQDLKG 
DDTAVRDAHS KRDTKCQPQGS SGE EKGTPTTLRGGEAS ERKRPDS GC S TSKDTKYQS VY 
VISEEKDECVTATEV 
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Figure 19 

GCGGAGACAAGCGCAGAGCGCAGCGCACGGCCACAGACAGCCCTGGGCATCCACCGACGG 
CGCAGCCGGAGCCAGCAGAGCCGGAAGGCGCGCCCCGGGCAGAGAAAGCCGA 

ATCTCCCTGCTCCCACGCCGCGCCCCTCCGGTCAGCATGAGGCTCCTGGCGGCCGCGCTG 

CTCCTGCTG CTGCTGG CGCTGTACACCGCGCGTGTGGACGGG TCCAAATGCAAGTGCTCC 

CGGAAGGGACCCAAGATCCGCTACAGCGACGTGAAGAAGCTGGAAATGAAGCCAAAGTAC 

CCGCACTGCGAGGAGAAGATGGTTATCATCACCACCAAGAGCGTGT 

CAGGAGCkCTGCCTGCACCCCAAGCTGCAGAGGACC^ 

GCCTGGAACGAGAAGCGC^GGGTCTACGAAGAATAGGGTGAAAAA 

CTCCAAACCAGTTGGGAGACTTGTGCAAAGGACTTTGCAGAT^ 

AAAAAAAAAAAAAAAAAAAAAAAAAAAGCCTTTCTTTCT 

ATATATTGTTATGAAGCACTTTTTACCAACGGTCAGTTT^ 

GAAAGGCTTCCAGATGGGAGACCCATCTCTCTTGTGCTCCAGAC 

TTCATCACAGGCTGCTTTTTATCAAAAAGGGGAAAACT 

TG CTTTTTTGTATTTGTCCATACGTCACTATACATCTGAG CTTTATAAGCG CCCGGGAGG 
AACAATCAGCTTGGTGGACACATTTCATTGCAGTGTTGCT 

TTCCGCTTAGAGGTCCTGGCGCCTCGGCACAGCTGCCACGGGCTCTCCTGGGCTTATGGC 
CGGTCACAGCCTCAGTGTGACTCCACAGTGGTC^ 

CTCTCTGCATCTGTTCTCTGAGGAACTCAAGTTTGGTTGCCAGAAAAATGTC 

CCCCCTGGTTAATTTTTACACACCCTAGGAAACATTTCCAAGA 

AAATGATCCTTAAAGAAGGTGTGGGGTCTTTCCCAACCTGAGGATTTCTGAAA 

AGGTTCAATATTTAATGCTTCAGAAGCATGTGAGGTTCCO^CACTG 

TAGGAGAAAACTTAAAAATATATGAATACATGCGCAATACACAGCTACAGA 

TGTTGACAAGGGAAAACCTTCAAAGCATGTTTCTTC 

TACTAAAGCAATATATTTGTGATTCC C CATGTAATTCTTCAATGTTAAACAGTGCAGTCC 
TCTTTCGAAAG CTAAGATGAC CATGCG CCCTTTCCTCTGTACATATACCCTTAAGAACGC 
CCCCTCCACACACTGCCCCCCAGTATATGCCGCATTG 

ACATGTCAGAAACCATTAG CATTG CATGCAGGTTTCATATTCTTTCTAAGATGGAAAGTA 
ATAAAATATATTTGAAATGTAAAAAAAAAAAAAAA 
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Figure 20 

MSULPRRAPPVSMRU^AAL^ 

PHCEEKNTVinTKSVSRYRGQEHClJiPKl^STKlU^KWYNAWNEKRRVYEE 
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Figure 21 

GGAGAGCGGAGCGAAGCTGGATAACAGGGGACCG 

ATGATGTGGCGACCATCAGITCTGCIGCIT^ 

AAGCCATCCCCAGACGCAGG<X:CT^ 

AGCGACGCTCCCCATGATGACGCCCACGGGAACTTCCACT 

GGACGGGAAGTX^CCAAGGAATTCGACCAACTCACCCCAGAOGA 

GGGCGGATCGTGGACCGCATGGACCGCGCGGGGGACGGCGACGGCTGGGTGTCGCTGGCC 

GAGCTIWCGCGTGGATCGCGCACACGCAGCM 

GCCTGGGA»CGTACGACACGGACCGC^ 

GCCACCTATGGCCACTACGCGCCCGGTGAAGAATTTCATGACGTGGAGGATGCAGAGACC 
TACAAAAAGATGCTGGCTOGGGACGAGCGGCGTTTCCGGGTGGCCGACCAGGATGGGGAC 
TCGATGGCCACTCGAGAGGAGCTGACAGCCTTCC^ 

CGGGACATCGTGATTGCTGAAACCCTGGAGGACCTGGACAGAAACAAAGATGG 

CAGGTGGAGGAGTACATCGCGGATCTGTACTCAGCCGAGCCTGGGGAGGAGGAGCCGGCG 

TGGGTGCAGACGGAGAGGCAGCAGTTCCGGGACTTCCGGGATCTGAACAAGGAT^ 

CTGGATGGGAGTGAGGTGGGCCACTGGGTGCTGCCCCCTGCCCAGGACCAGCCCCTGGTC 

GAAGCCAACCACCTGCTGCACGAGAGCGACACGGACAAGGATGGGCGGCTGAGCAAAGCG 

GAAATCCTGGG TAATTGGAA CATGTTTGTGGG CAGTCAGGC CA C CAACTATGG CGAG GA C 

CTGACCaSGCACGATGATGAGCTGTOAGCAC^^ 

CACAATGACCGGAGGAGGGGCCGCTGTGGTCTGGCCCCCTCCCTGTCCAGGCCCCGCAGG 
AGGCAGATGCAGTCCCAGGCATCCTCCTGCCCCTGGGCTCTCAGGGACCCCCTGGGTCGG 
CTTCTGTCCCTGTCACACCCCCAACCrCAGGGAGGGGCT 

CAATACCTATTTCTGACTGAGTCTCCCAGCCCAGACCCAGGGACCCTTGGCCCCAAGCTC 
AGCTCTAAGAACCGCCCCAACCCCTCCAGCTCCAAATCT 

AAACTCCCCTGGCCCCAGCCCTCTCCTGCCTGGCCTGGCCTGGGACACCTCCTCTCTGCC 
AGGAGGCAATAAAAG CCAGCGCCGGGAC CTTGAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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Figure 22 

HMWRPSVLLIXLIiLRHGAQGCT^ 

GREVAKETDQLTPEESOAIU^IVDRMDRAGDCDGWVSIA^ 

AWDTYDTDRTCRVGWEELRNATYGHyAPGEEFHDVEDAETYKK 

SMATREELTAFI^PEEFPHMRDIVIAETLEDLDRNKDGYVQVEEYI 

WVQTERCXJFRDFRDLNKDGHLDGSEVGHWV^ 

EILGNWNMFVGSQATNYGEDLTRHHDEI* 
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Figure 23 

CAAAACTTGCXTIXXSCGGAGAGCGCCCAGCTTGAC^ 

GAGCGCAGCTGA^CTGGGGGAGCGCGTrCGGCCTGTGGGGCGCCGCTOGGCGCOGGGGC 
GCAGCAGGGAAGGGCAAGCTCT G GTCTGCCCTGCTCCACGAGGOGCCACT G G^ 
GGGAGAGCCCCTOGOTGGTCCCGTCCCCTATCLC 1'L C ITiATATAGAAACCTTCCACACT 
GGGAAGGCAGCGGCGAGGCAGGAGGGCTCATGGTGAGCAAGGAGGCCGGCTGATCT 
GCG CACAGCATTCCGAGTTTACAGATTTTTACAGATACCAA 

ATGGAAGGCGAGGAGGCAGAACAGCCTG CCTGGTTCCATCAGCCCTGG CGCCCAGGCG CA 
TCIX^CTCGGCACCCCCTGCAGGCACCaTGGCCCAGAGCCG 



CGAAGTGGCGGCCACAGCCTGAGCCCCGAAGAGAACGAATTTGOGGAGGAGGAGCCGGTG 
CTGGTACTGAGCCCTGAQGAGCCCGGGCCT^ 

GGGGACCTrcCTGAGCACACCAACCACCTATCT^ 
TACCCTGAGGAGCTCTCCCGGCTGCACCGGCTGGAGACACTGAACCTC 
CTGACTTCCCGAGGGCTCCCAGAGAAGG Cta T J 1 ' bA GCATCTGACCAACCTCAATTACCTG 
TA CTTGGCCAATAACAAGCniACCTTGGCACCCCG CTTCCTGCCAAACGCCCTGATCAGT 
GTGGACTTTGCTGCCAACTATCTCACCAAGATCTATGGGCTC^ 

AACTTGAGGTCTGTGTACCTGCACAACAACAAGCTGGCAGACGCCGGGCTGCCGGACAAC 
ATGTTCAACGGCTCCAGCAACGTCGAGCTCCTCATCCTGTCCAGCAACTTCCTC 
GTGCCCAAGCACCTGCCGCCTGCCCTGTACAAGCTGCACCTCAAGAACAACAAGCTGGAG 
AAGATCCCCCCGGGGGCCTTCAGCGAGCTGAGCAGCCTGCGCGAGCTATACCTGCAGAAC 
AACTACCTGACTGACGAGGGCCTGGACAACGAGACCTTC7 GGAAGCTCTCCAGCCTGGAG 

GTGCTGCTGCACTTGGAGAAGAACGCCATCCGGAGCGTGGACGCGAATGTGCTGACCCC^ 
ATCCGCAGCCTGGAGTACCTGCTGCTGCACAGCAACCAGCTGCGGGAGCAGGGCATCCAC 
CCACTGGCCTTCCAGGGCCTCAAGCGGTTGCACACGGT^^ 
GAGCGCGTGCCCAGTGGCCTGCCTCGCCGCGTGOT 

ATCACAGGCATTGGCCGCGAAGACTTTGCCACCACCTACTTCCTGGAGGAGCTCAACCT 
AGCTACAACCGCATCACOU3CCCA<^GGTGCACCGC^ 

CTGCGCTCG CTGGACCTGTCGGGCAACCGGCTGCACACteCTGCCACCTGGGCTC 

GCGGGCATGGCTCAGCTGCGTGAGCTGTACCTCACCAGCAACCGACTC 

CAGCTCACAGAGATCCCCGAGGGGCTCCCCGAGTCACTTGAGTACCTGTACCTGCAGAAC 
AACAAGATTAGTGCGGTGCCCGCCAATGCCTTCGACTCC^ 

TTTCT CAGGTTTAACAAGCTGGCTGTGGGCTCCGTGGTGGACAGTGCCTTCCGGAGGCTG 

AAGCACCTGCAGGTCTTGGACATTGAAGGCAACTTAGAGTTTGGT^ 

CGTGG CCGCTTGG GGAAGGAAAAGGAGGAGGAG GAAGAGGAGGAGGAGGAGGAAGAGGAA 

ACAAGATAQTGACAAGGTGATGCAGATGTGACCTAGGATGA 

CTGCAGCACACG C C TGT G TGCTGTGAGCCCCCCACTCTGCCGTGCTCACACAGACACACC 

CAGCTGCACACATGAGGCATCCCACATGAC^CGGGCTGACACAGTCTCATATC^ 

CTTCCCACGGCGTGTCCCACGGCCAGACMATGCA^ 



CTGCCACGCCCTCTGAATCATGCAGGGAAGGGTCT^ 

CCCATTCCCTCCCCCTGCTGACATGTGTATGCGTATGCATACACACCACACACACACACA 
TGCACAAGTCATGTGCGAAC^GCCCTCCAAAGCCTATGCCACAGACAGCT 
CCAGAATCAGCCATAGCAGCTCGCCGTCTGCCCTGTCCATCTGTCCGTCCGTTCCCTGG^ 
GAAGACACAAGGGTATCCATGCTCTGrrGGCCAGGTGCCTC 

AAG CTGG CTTTTATTCCTTTCC CATCCTATGGGGA CAGGAGCCTTCAGGACTGCTGGCCT 

GGCCTGGCCCACCCTGCTCCTCCAGGTGCTGGGCAGTCACTCTGCTAAGAGTCCCTCCCT 

GCCACGCCCTGGCAGGACACAGGCAL 1 TTTCCAATGGGCAAGCCCAGTGGAGGCAGGATG 

GGAGAGCCCCCTtXSGTGCTGCTGGGGCCTTGGGGCAGGAGTGAAG^ 

TGGGCTGAGCCAGGGAGGAAGGACCGAGCTGGACCTAGGAGACACCTTTGTTCTTC^ 

CTGTGGGGGAAGT rCCGGGTGCCTTTATT TTTI ATl'LTriTCIAAGGAAAAAAATGATAA 

AAATCTCAAAGCTGATTTTTCTTGTTATAGAAAAACTAATATAAAAGCA 
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Figure 24 

MEGEEAEQPAWFHQPWRPGASDSAPPAGTMAQSRVLLLIALLPPQLHI^ 

RSGGHS LS PEENEFABEEPVLVLS PEEPGPGPAAVSCPRDCACSQEGWDCGG IDLREFP 

GDLPEHTNHLSLQNNQLEKI YPE EILSRIJIRLETLNLONNRLTSRGLPEKAFEHLTNLNYL 

YTtAlWKLTIAPRFLPNALISVDFAANYLTKIY 

MFWGSSNVEVLILSSNFLRHVPIOiLPPALYKLm^ 

NYLTDEGLDNETFWKLSSLEYLDLS SNOTjSRVPAGLPRSLVIJjHLEKNAIRSVDANVLTP 
IRSLEYIXLHSNQLREC^IHPLAFQGLKRIJrrVHLY^ 

ITGI GREDPATTYFLEELNLS YKRITSPQVHRDAFRKLRLLRSLDLSGNRLHTLPPGLPR 
NVHVLKVKRNEIiAAIARGALAGMAQLRELYLTSl^ 

QLTEIPEGLPESLEYLYLQinnCISAVPAKAFDSTPNLKGIFIJIFNKIAVGSVTO 
KHLQVXJDI EGNLEFGDISKDRGRLGKEKEEEEEEEEEEEETR 
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Figure 25 

GGCGCCGGTGCACCGGGOGGGCTGJ^CGCCTCCTGCGGCCCGGCCTGCGCGCCCOGGCCC 

GCCGCGCCGCCC^CGCCCCAACCCCXX^CCGCGCCCCCTAGCCCCCGCCCGGGCCCGCGC 

CCGCGCCCGCGCCCAGGTGAGCGCTCCGCCCGCCGCGAGGCCC^^ 

GCCCCGCCCCGGCCGGCGGGGGAACCGGGCGGATTCCTCGCGCGTCAAACCACCTG^ 

CATAAAACATTCATCCTCCCGGCGGCCCGCGCTGCGAGCGCCCCGC CAGTCCGCGCCGCC 

GCCGCCCTCGCCCTGTGCGCCCTGCGCGCCCT 

GCCGGGCGGAGCGGAGCGCGCCGAGCCTCGTCCCGCGGCCGGGCCGGGGCCGGGCCGTAG 

CGGCGGCGCCTGGATGCGGACCCGGCCGCGGGGAGACGGGCGCCCG CCCCGAAACGACTT 

TCAGTCCCCGACGCGCCCCGCCCAACCCCTACG 

ATGAAGAGGGCGTCCGCTGGAGGGAGCC^ 

TGGCAGGTGGCAGCCCCATGCCCAGGTGCCTGCGTATC 

ACAAGCTGCCCCCAGCAGGGCCTGCAGGCTGTGCCCGTGGGCATCCCTGCT 

CGCATCITCCTGCACGGCAACCGCATCTCGCATGTGCCAGCTGCC^ 

CGCAACCTCACCATCCTGTGGCTGCACTCGAATGTGCTGGCCCGAATTGATGCGGCTGCC 

TTCACTGGCCTGGCCCTCCTGGAGCAGCTGGACCTCAGCGATAATGCACAGCTCCGGTCT 

GTGGACCCTGCCACATTCCACGGCCTGGGCCGCCTACACACGCTGCACCTGGACCGCTGC 

GGCCTGCAGGAGCTGGGCCCGGGGCTGTTCCGCGGCCTGGCTGCCCTGCAGTACCT 

CTGCAGGACAACGCGCTGCAGGCACTGCCTGATGACACCTTCCGCGACCT 

ACACACCTCTTCCTGCACGGCAACCGCATCTCCAGCGTGCCCGAGCGCGCCTTCCGTGOT 

GCCTTCCGTGACCTTGGCCGCCTCATGACACTCTATCTGTTTGCCAA 

CTGCCCACTGAGGCCCTGGCCCCCCTGCGTGCCCTGCAGTACCTGAGGCT 

CCCTGGGTGTGTGACTGCCGGGCACGCCCACTCTGGGCCTGGCTGCAGAAGTTCCGCGGC 

TCCTCCTCCGAGGTGCCCTGCAGCCTCCCGCAACGCCTGGCTGGCCGTGACCTCAAACGC 

CTAGCTGCCAATGACCTGCAGGGCTGCGCTGTGGCCACCGGCCCTC 

ACCGGCAGGGCCACCGATGAGGAGCCGCTGGGGCTTCCCAAGTGCTG 

GCTGAGAAGGCCTCAGTACTGGAGCCTGGAAGACCAGCTTCGGCAGGCAATGCGCTGAAG 

GGACGCGTXXTCGCCCGGTGACAGCCCGCCGGGCAACGGCT 

GACTCACCCTTTGGGACTCTGCCTGGCTCTGCTGAGCCCCCGCTCACTGCAGTGCGGCCC 

GAGGGCTCCGAGCCACCAGGGTTCCCCACCTCGGGCCCTCGCCGGAGGCCAGGCTGTT^ 

CGCAAGAACCGCACCCGCAGCCACTGCCGTCTGGGCCAGGCAGGCAGCGGGGGTGGCGGG 

ACTGGTGACTCAGAAGGCTCAGGTGCCCTACCCAGCCTCACCTGCAGCCTCACCCCCCTG 

GGCCTGGCGCTGGTGCTGTGGACAGTGCTTGGGCCCTC 

GCGTGCTCAGCAGCCAGGTGTGTGTACATACGGGGTCT 

CGGGCGGCCGACCCGTGGGGCAGGCCAGGCCAGGTCCTCCCTGATGGACGCCTGCCGCCC 
GCCACCCCCATCTCCACCCCATCATGTTTACAGGGTTCGGCGGCAGC Gl ' lTG TTCCAGAA 
CGCCGCCTCCCACCCAGATCGCGGTATATAGAGATATO 
AATATCGGACGACGTGGAATAAAGAGCTC TT TTCTTA AAAAAA 
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Figure 26 

MKR^AGGSRLIJVWVLWI^AWQ 

QRI FLHGNRI SHVPAASFRACRNLTI LWLHSNVLARICAAAFTGLALLEQLDLSDKAQL 
RSVDPATFHGLGRLHTLHLDRCGLQELG PGLFRGLAALQYLYLQDNAWALPDDTFRDL 
GNLTHL FLHGNRI SSVPBRAFRGLHSLDRIJjLHQNRVAHVHPHAFRDIiGRLMTLYLFAN 
NLSALPTEAIJ^LRAI^YLRIJJDNPWCDC^ 
RDLKRIJU^NDLQGCAVATGPYHPIWTGRATDEE 

AGNALXGRVPPGDSPPGNGSGPRHINDSPFGTI»PGSAEPPLTAVRPEGSEPPGFPTSGP 
RRRPGCSRKTOTRSHCRIiGOAGSGGGGTGDSEGSGALPSLTCSLTPLGLALVLWTVLGP 
C 
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Figure 27 

GCCCCAGGGACTGCTATGGCTTCCTTTGTTGTTCACCCCGGTCTGCGTC 

AIXjI^IJ^AACTCCAATGTCCTCCTCTGGT^^ 

GACAGCCAAGCACAGTATCCAGTTGTCAACACAA^ 

ACACCGTTACCCAATGAGATCTTGGGTCCAGTGGAGCAGT^ 

TCACCCCCCACTGGAGAGAGGCGGTTTO^ 

ATCCGAAATACTACTCAGTTTCCTGCTGTGTGCC CCCAGCAC CTGGATGAGAGATCCTTA 

CTGCATGACATGCTGCCCATCTGGTTTACCGCCAACT 

CAAGATCAAAA.TGAAGACTGCCTTTACTTAAACATCTACGTC 

AACACAAAGAAAAACGCAGATGATATAACGAGTAATGACCGTGGTGAAGACGAAGATArr 
CATGATCAGAACAGTAAGAAGCCCGTCATGGTCTATATCCATGGGG 
GGCACCGGCAAC^TGATTGACGGC^GCATTTTGGC^ 
ACCATTAACTACCGTCTGGGAATACTAGGGTTTITAAGTACC^ 

GGCAACTATGGGCTCCTGGATCAGATTCAAGCACTGCGGTGGATTGAGGAGAATGTGG^ 
G CCTTTGGCGGGGACCCCAAGAGAG TGACCATCnTTGGCTCGGGGGCTGGGGCCTCCTGT 
GTCAGCCTGTTGACCCTGTCCCACTACT^ 

AGCGGCACCGCCCTGTCCAGCTGGGCAGTGAACTACCAGCCGGCCAAGTACACTCGGATA 

TTGGCAGACAAGGTCGGCTGCAACATGCTGGACACCACGGACATGGTAGAATC 

AACAAGAACTACAAGGAGCTCATCCAGCAGACCATCACCCCGGCCACCTACCACATAGCC 

TTCGGGCCGGTGATCGACGGCGACGTCATCCCAGACGACCCCCAGATCCTGATGGAGCAA 

GGCGAGTTCCTCAACTACGACATCATGCTGGGCGTCAACCAAGGGGAAGGCCTGAAGTTC 

GTGGACGGCATCGTGGATAACGAGGACGGTGTGACGCCCAACGACTTTGACTTCTCCGTG 

TCCA^CTTCGTGGACAACCTTTACGGCTACCCTGAAGGGAAAGACACTTTC 

ATCAAGTTCATGTACACAGACTGGGCCGATAAGGAAAACCCGGAGACGCGGCGGAAAACC 

CTGGTGGCTCTCTTTACTGACCACCAGTGGGTGGCCCCCGCCGTGGCCGCCGACCTGCAC 

GCGCAGTACGGCTCCCCCACCTACTTCTATGCCTTCTATCATCACTGCCAAAGCGAAATG 

AAGCCCAGCTGGGCAGATTCGGCCCATGGTGATGAGGTCCCCTATGTCTTCGGCATCCCC 

ATGATCGGTCCCACCGAGCTCTTCAGTTGTAACTTTTCCAAGAACGA 

G CCGTGGTCATGAC CTACTGGACGAACTTCGCCAAAACTGGTGATCCAAAT CAACCAGTT 

CCTCAGGATACCAAGTTCATTaCACAAAACCCAACCGCTTTGAAGAA 

aagtataatcco^gaccagctctatctgcatattggcttgaaaccca 

cactaccgggcaacgaaagtggctttctggttg^ 

aacgagatattccagtatgtttcaacaaccacaaaggt^ 

tttccctatggcacccggcgatctcccgccaagatatggccaaccaccaaacgcccagca 

atcactcctgccaacaatcccaaacactctaaggaccctcacaaaacagggcctc 

acaactgtcctcattga^ccaaacgagattattccaccgaattaagtgtcaccattgcc 

gtcggggcgtcgctcctcttcctcaacatcttagcttttc 

gacaagaggcgcgatgagactcacaggcgccccagtccccagagaaacaccacaaatgat 

atcgctcacatccagaacgaagagatcatgtctctgcagatgam 

cacgagtgtgagtcgctgcaggcacacgacacactgaggctcacctgcccgccagactac 

accctcacgctgcgccggtcgccagatgacatcccacttatgacgccaaacaccatcacc 

atgattccaaacacactgacggggatgcagcctrixj cacacttttaacaccttcagtgga 

ggacaaaacagtacaaatttaccccacggacattccac^ 

atttcccttcctatccctctgccctacccgctcagcaacatag^ 

agaaggaaagagagagagaaagaaagtctccagaccaggaatg ttttt gt^ 

taagacaaaaatgcaaaaaggcagtcatcccatccx:g 

TCCAGTATTACAAGATCAACTTCTGACCCTGTGAAATGTGAGAAGTACACATTTCT 

AAATAACTGCTTTAAGAT LTl" LTA C CA CTC CAATCAATGTTTAG tgtgataggacatcac c 

ATTTCAAGGCCCCGGGTGTTTCCAACGTCATGGAAGCAGCT 

CAAGGACACTTGATATTTTTTAATTACAATGGAAGTT^^ 

CAATCGATGGCTCTCCTTAAGTGAAGAAAGAGTCAATGAGATTTTGCCtt 

TGTAATCCAGAGAGAAGGAAACGTAGAAATTTATT 

TGT 
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Figure 28 

MLNSNVLLWLTALAI KFTL I DSQAQYPWNTNYGKI RGLRTPLPNEILGPVEQYLGVPYA 

S P PTGERRFQPPEPPS SWTGIRNTTQFAAVCPQHIJ>ERSIJJHDML PIWFTAI^DTLMTW 

QDQNEDCLYl^IYVPTEIXSANTKKNADDITSNDRGEDEDIHDQNSKI^ 

GTGNM I DGS ILASYGNVI VI TINYRLG I LGFLSTGDQAAKGNYGLLDQI QALRW I EENVG 

AFGGDPKRVT I FGSGAGAS CVS LLTLS HY S EGLFQKAI I QS GTALSS WAVNYQP AKYTRI 

IJ^KVGCNMLDTTDMVECLRNKNYKEL I QQTI TPATYHI AFGP VTDGDVI PDDPOILMEQ 

GEFLNYDIMLGVNQGEGLIQFVDGrVDNEDGVTPNDFDFSVSNFVDl^YGY 

IKFMYTDWADKENPETRRKTLVALFTDH^^ 

KPSWADSAHGDEVPYVFG I PMI GPTELFSCNFSIO&DVMLSAVVMTYWTNFAKTGDPNQFV 
PQDTKFIHTKPNRFEEVAWSKYNPKDQLYLHIGLKPRVRJDH^ 

NEI FQYVSTTTKVPPPDMTS FP YGTRRSPAKIWPTTKRPAITPANNPKHSCTPHKTGPED 
TTVLI ETKRDYSTELSVTI AVGASLLFLNI LAFAALYYKKDKRRHETHRRPSPQRNTTND 
I AHI QNEE I MS LQMKQLEHDHE CES LQAHCTLRLTC PPDYTLTLRRS PDD I PLMT PNTI T 
MI PNTLTGMQPLHTFNTFSGGQNSTNLPHGHSTTRV 



( 
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Figure 2 9 

GCGGAGCATCCGCTGCGGTCCTCGCCGAGACCCCCGCGCGGATTCGCCGGTCCTTCCCGC 

GGGCGCGACAGAGCTGTCCTCGCACCTGGATGGCAGCAGGGGCGCCGGGGTCCTCTCGAC 

GCCAGAGAGAAATCTCATCATCTGTGCAGCCTTCTTAAAGCAAACT 

GATTATCCTTGACXTTTGAAGACCAAAACTAAACTGAAATTTAAA 

AT G TT C rr CGGGGGAGAAGGGAGCTTGACTTACACT 

ATCCAGTCATCTCTTTCTAAGGGAATCAGAGGCAATGAGC^ 

GAAGACTGCATTAATTCTTGCTGTTCAACAAAAAACAT^^ 

TTGATGATCTTCGACACTCGAAAAACAGCTAGACAACCCAACTC 

CCCAACGAGGAAGCCTGTC CATTGAAACCAGCAAAAGGACTTATGAGTTACAGGATAATT 
ACAGATTTTCCATCTTTGACCAGAAATTTGCCAAGCCAAGAGTTACC 
CTCTTACATGGCCAATTTT CACAAGCAGTCACTCCCCTAG CCCATCATCACACAGATTAT 
TCAAAGCCCACCGATATCTCATGGAGAGACACACTT7CTCAGAAGTTTGGA 
CACCTGGAGAAftCTATTTAAGATGGATGAAGCAAGTGCCCAGCTC CTrUClTATAAGGAA 
AAAGGCCATTCTCAGAGTTCACAATTTTCCTCTGATCAAGAAATAGCTCATCT 
GAAAATGTGAGTGCGCTCCCAGCTACGGTGGCAGTTGCTTCTCCACATACCACCTCGGCT 
ACTCCAAAGCCCGCCACCCTTCTACCCACCTATGCTT^ 

CAG CCACAGCTGGC CACCACAG CTCCACCTGTAACCACTGTCACTTCTCAGCCTCCCACG 

ACCCTCATTTCTACAGTTTTTACACGGGCTGCGGCTACACTOT 

GCAGTTCTGACTACCACCTTTCAGGCACCTACGGA 

CCGTTTACAGAAATCTCCAACTTAACTTTGAAC 

CTTTCTATGTCAAATGTGGAGTCTTCCACTATGAATAAAACTGCT^ 

GAGGCCAGTCCAGGCAGTTCCTCCCAGGGCAGTGTTCCAGAAAATCAGTACGGCCTTCCA 

TTTGAAAAATGGCTTCTTATCGGGTCCCTGCTCTTTGGTGTC 

CTCGT CCTCCTGGGTAGAATCCTTTCGGAATCACTCCGCAGGAAACGTTA 

CTCAAGACTGGATTATTTGATCAATGGGATCTATGTGGA .. 

GTCTCTTAATTCATTTAGTAACCAGAAGCCCAAATGCAATGAGTTTCT 

GTCTTAGCAGGAGGTTGTATTTTGAAGACAGGAAAATGCCCCCTTCTC VI TTT 

TTTTTGGAGACAGAGTCTTGCrrCTGTTGCCCAGGCTGGAGTGCA 

TCTCACCGCAACCTCCGTCTCCTGGGTTCAAGCGATTCTCCTGCCTCAGCCTCCTAAGTA 

TCTGGGATTACAGGCATGTGCCACCACACCTGGGTGATTTTO 

GGGTTTCACCATGTTGGTCAGGCTGGTCTCAAACTCCTGACCTAG 

ATGTTTGGTTTTTGAGAAGGAATGAAGTGGGAACCAAATTAGGTAATTTO 
TCTCTAAAATATTAGCTAAAAACAAAGCTCTATGTAAAGTAATAAAGTATAATTGCCATA 
TAAATTTCAAAATTCAACTGGCTTTTATGCAAAGAAACAG^ 
ATTCATTCACATTCTTGGTTCCAGATAAAATCAACTGTT^ 

TGCTTTTCTTTITATATGGATTCCTTTAAAACTTATTCCAGATG TAGTTCCTTCCAATTA 
AATATTTGAATAAATCTTTTGTTACTCAA 
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ATTTTCTGTGAAGGAACCAACTGATCTCCCCCACC 

TTACCCACAGATAACACATGTTGTTTCTACTTGTAAATGT^ 

TTACAGATAAAAAA 
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Figure 30 

MFFGGEGSLTYTLVT I CFLTLRLSASQNCUCKSLEDWIDIQSSLSKGIRGNCPVYTSTQ 
EDCINSCCSTKNISGDKACNLMI FDTRKTARQPNCYLFPCPNEEACPLKPAKGLMSYRI I 
TDFPSLTTOnjPSQELPQEDSIXHGQFSQAVTPIiAHHHTOTSKPTDISWRDTLSQKFGSSD 
HLEXLFKMDEASAQLLAYKEKGHSQSSQFSSDQEIAHIjLPENVSAL^ PHTTSA 
TPKPATIAPTNASVTPSGTSQPQIATTAPPVTTVTSQP PTTLI STVFTRAAATLQAMATT 
AVLTTTFQAPTDS KGS LET I PFTEI SNLTLIHX3NVYNPTALSMSNVESSTMNKXASWEGR 
EASPGSSSQGSVPENQYGLPFEKWLLIGSLLFGVLFLVIGLVL^ 
DYLINGIYVDI 
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Figure 31 

CCCACGCGTCCGCCCACGCGTCCGCCCACGGGTCCGCCCACGCGTCCGGGCCACCAGAAG 
TTTGAGCCTCTTTGGTAGCAGGAGGCTGGAAGAAAGGA 
ATGGGGATCTTACTGGGCCTGCTACTCCTGGGGCACCTAACAGTGGA^ 
CCCATCCTGGAAGTTCCAGAGAGTGTAACA^^ 

TGCAC CTATGACCCCCTGCAAGGCTACACC CAAGT CTTGGTGAAGTGGCTGGTACAACGT 
GGCTCAGACCCTGTOVCCATCTTT^ 

AAGTACCAGGGCCGCCTGCATGTGAGCCACAAGGTTCCAGGAGATGTATCCCT 

AGCACCCTGGAGATGGATGACCGGAGCCACTACACGTGTGAAGTCACCTGGCAGACTCCT 

GATGGCAACCAAGTCGTGAGAGATAAGATTACTGAGCTCCGTGTCCAGAAA 

TCCAAGCCCACAGTGACAACTGGCAGCGGTTATGGCTTCACGGTGCCCC^ 

ATTAGCCTTCAATGCCAGGCTCGGGGTTCTCCTCCCATCAGTTATATTTGGTATAAGCAA 

CAGACTAATAACCAGGAACCCATCAAAGTAGCAACCCTAACT 

GCGGTGATAGCCGACTCAGGCTCCTATTTCTGCACrGCC^ 

CAGCACAGOSACATTGTGAAGTTTGTGGTCAAAGACTCCTC^ 

ACTGAGGCACCTACAACCATGACATACCCCTTGAAAGCAACATCTACAGTGAAGCAGTCC 

TGGGACTGGACCACTGACATGGATGGCTACCTTGGAGAGACCAGTGC 

AGCCTGCCTGTCTTTGCCATCATCCTCATCATCTCCTTGTGCTCT^ 

ATGG CCTATATCATGCTCTGTCGGAAGACATCCCAACAAGAG CATGT CTACGAAGCAGCC 

AGGTAAGAAAGTCTCTCCTCTTCCATTTTTGACCCCGTCCCT^ 

TGGCAGGAAATGTGGAGGAAGGGGGGTGTGGCACAGAGCCAATCCTAAGGCCGGAGGCCT 
TCAGGGTCAGGACATAGCTCCCTTCCCTCTCTCAGGCACCT^ 

TCTGAACACAAAGGATAATTTAGATCCATCTGCCTTCTGCTTCCAGAATCCCTGGGTGGT 

AGGATCCTGATAATTAATTGGCAAGAATTGAGGCAGAAGGGTGGGAAACCAGGACCACAG 

CCCCAAGTCCCTTCT TA TGGGTGGTGGGCTCTTGGGCCATAGGGCACATGCCAGAGAGGC 

CAACX3ACTCTGGAGAAACCATGAGGGTGGCCATCTTCGCAAGTGGCTGCT 

GCCAACTTCCCAGAATCTGGGCAACAACTACTCTGATGAGCCCTGCATAGGACAGGAGTA 



TTATGAGTTTCTGGCCACTGAGGGCAAAAGTGTCTGTTAAAAATGCCCCATTAGGCCAGG 
ATCTGCTGACATAATTGCCTAGTCAGTCCTTGCCTTCTGCATGGCCTTCTTCCCTGCTAC 




GGGCCCTTCTAGTATCTCTGCCGGGGGCTTCTGGTACTCCTCTCTAAATACCAGAGGGAA 
GATGCCCATAGCACTAGGACTTGGTCATCATGCCTACAGACACTATTCAACTTTGGCATC 
TTGCCACCAGAAGACCCGAGGGAGGCTCAGCTCTGCCAGCTCAGAGGACCAGCTATATCC 
AGGATCATTTCTCTTTCTTCAGGGCCAGACAGCTTTTAATTGAAATTGTTATTTCA 



AATAAATATCTAATCATAACAGC 
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Figure 32 

MGI LLGLLXiLGHLTVDTYGRPI LBVPESVTG PWKGDVNLPCT r ^PLQGYTQVLVICWLVQR 
GSDPVTIFIJIDSSGDHIQQAKYQGI^HVSHKVPGDVSMLSTLEMDDRSHYTCBVTWQTO 
DGNQWRDKI TELRVQKLSVSKPTVTTGSGY GFTVPQGMRI SLQCQARGS PPI S YIWYKQ 
QTNNQEP I KVATLSTIJ-FTCPAVIADSGS YFCTAKGQVGSEQHSDrVKFVVKD^ 
TEAPTTWIYPLKATSTVKQSOTWTTDMDGYLGETSAG PGKS LPVFAI I L 1 1 SLCCMWFT 
MAYIMLCRXTSQQEHVYEAAR 
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Figure 33 

GCGCCGGGAGCCCATCTGCCCCCAGGGGCAC66GGCGCGGGGCCGGCTCCCGCCCGGCAC 
ATGGCTGCAGCCACCTCGCGCGCACCCCGAGGCGCCGCGCCCAGCTCGCCCGAGGTCCGT 
CGGAGGCGCCCGGCCGCCCCGGAGCCAAGCAGCAACTGAGCGGGGAAGCGCCCGCGTCCG 
GGGATCGGG 

ATGT CCCTCCTCCi"rulCci\.~i iGCT^ 

ACTGAGATCAAGAGAGTGGGAGAGGAAAAGGTCACTTTGCCCTGCGACCATCAACTGG^ 

CTTCCAGAAAAAGACACTCTGGATATTGAATGGCTGCTCACCGATAATGA 

AAAGTGGTGATCACTTACTCCAGTCGTCATGTCTACAATAACTTGACT^ 

GG CC GAGTGGC LTTTGCTTC CAATTT C CTGG CAGGAGATG C CTCCTTGCAGATTGAACCT 

CTGAAGCCCAGTGATGAGGGCCGGTACACCTGTAAGGTTAAGAATTCAGGGCGCTACGTG 

TGGAGCCATGTCATCTTAAAAGTCTTAGTGAGACCATCCAAGCCCAAGTGTGAGTTGGAA 

GGAGAGCTGA CAGAAGGAAGTGACCTGACTITGCAG TCTGAGTCATC CTCTGG CACAGAG 

CCCATTGTGTATTACTGGCAGCGAATCCGAGAGAAAGAGGGAGAGGATGAACGTCTGCCT 

CCCAAATCTAGGATTGACTACAACCACCCTGGACGAGTTCTGCTGCAGAATCTTACCATG 

TCCTACTCTGGACTGTACCAGTGCACAGCAGGCAACGAAGC^ 

GTGCGAGTAACTGTACAGTATGTACAAAGCATCGGCATGGTTGCAGGAGCAGTGACAGGC 
ATAGTGGCTGGAGCCCTGCTGATTTTCCTCriWTGTGG 

AAAGAAAGATATGAGGAAGAAGAGAGACCTAATGAAATTCGAGAAGATGCTGAAGCTCCA 

AAAGCCCGTCTTGTGAAACCCAGCTCCTCTTCCTCAGGCTCTCGGAGCTCACGCTCTGGT 

TCTTCCTCCACTCGCTCCACAGCAAATAGTGCCTCACGCAGCCAGCGGACACTGTCAACT 

GACGCAGCACCCCAG CCAGGGCTGGCCAC CCAGGCATACAGCCTAGTGGGGCCAGAGGTG 

AGAGGTTCTGAACCAAAGAAAGTCCACCATGCTAATCTGACCAAAGCAGAAACCACACCC 

AGCATGATCCCCAGC CAGAGCAGAGCCTTCCAAACGGT CTGAA TTACAATGGACTTGACT 

C C CACG CTTTC CTAGGAGTCAGGGTLTri'GGACTCIT LTCG TCATTGGAG CT CAAGTCAC 

CAGCaCACAACCAGATGAGAGGTCATCTAAGTAGCAGTGAGCAT^ 

CAGATGAGCATTTTCCTTATACAATACCAAACAAGCAAAAGGATGTAAGCT 

GTAAAAAGGCATCTTATTGTGCCTTTAGACCAGAGTAAGGGAAAGCAGGAGT 

ATTTGTTGACCAGGACCTGTGGTGAGAAGGTTGGGGAAAGGTGAGGTGAATA 

Aeu"i"X"l AATGTGGGATATTTTG TAT CAGTGu~«TJrJ."GATTCACAAT CAAGAGGAAATGG 

GATGCTGTTTGTAAATTTTCTATGCATTTCTG CAAACTTATTGGATTATTAGTTATTCAG 

ACAGTCAAG CAGAACC CACAGC CTTATTACAC CTGTCTACACCATGTACTGAGCTAACCA 

CATAAGG TTTGGATATTAATTT CAAGGGGAGTTGAAATAGTGGGAGATGGAGAAGAGTGA 

ATGAGTTTCTCCCACTCTATACTAATCTCACTATTTGTATTGAGCCCAAAATAACTATGA 

AAGGAGACAAAAATTTGTGACAAAGGATTGTGAAGAGCTTTCCATCTO 

AGGATTGTTGACAAACATTAGAAATATATAATGGAGCAATTGTGGATTTCCCCTCAAAT^ 

AGATGCCTCTAAGGACTTTCCTGCTAGATATTTCTGGAAG 

TTATCAACGTCCTTAGAAAGAATTCTTCTAGA 

TTACCCAACATACCATTATAGTCTCTTCITTCTGAGAAAATC 

ACTGGGTGGACTAGAAAGGGAGATTAGATCAGTTTTCTCTTAAT^ 

CCGGGCATGGTGCCAGGCACCTGTAGGAAAATCCAGCAGGTGGAGGTTGCAGTGAGCCGA 
GATTATGCCATTGCACTCCAGCCTGGGTGACAGAGCGGGACTCCGTCTC 
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Figure 34 

MSUXLIXLVSYYVCTLGTHTEI^ 
KVVTTYSSRHVYN^ 

WSHVILKVLVRPSKPKCELEGELTEGSDLTLQCESSSCTEPrVYYWQRIREKEGEDERLP 
P1«IUDYNHPGRVLLQNLTMSYSGLYQCTAGNEAGKESCWRVTVQYVQSIGMVAGAVTG 
IVAGALUni-VWlJJRRKDKERYEEEERPNElREDAEAPKARLVKPSSSSSGSRSSRSG 
SSSTRSTANSASRSQRTLSTDAAPQPGLATQAYSLVGPEVRGSEPKKVHHANLTKAETTP 

SMIPSQSRAFQTV 
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Figure 35 

CACGCACTTCACCTGGCTCGGGATTCT 

GGGCGGGTGAGGACGGGGACGGGGCGTGTCCAACTGGCTGTGGGCTCTTGAAACCCGAGC 

&TGG CACAGCACGGGGCGATGGGCGCGTTTCGGGCCCTGTGCGGCCTGGCGCTGCTGTGC 

GCGCTCJ^CCTGGGTCAGCGCCCCACCGGGGGTCCCGGGTGCGGCCCTGGGCGCCTCCTG 

CTTGGGACGGGAACGGACGCGCGCTGCTGCCGGGTTCACACGACGCX3CTGCTGCCGCG 

TACCCGGGCGAGGAGTGCTGTTCCGAGTGGGACTGCATGTGTGTCCAGCCTGJ^ 

TGCGGAGACCCTTGCTGCACGACCTGCCGGCACCACCCTTGTCCCCCAGGCCAGGGGGTA 

CAGTCCCAGGGGAAATTCAGTTTTGGCTTCCAGTGTATCGACTGTC 

TCCGGGGGCCACGAAGGC^CTGCAAACCTTGGACAGACTGCACCCAGTTCGGGTTTCT 

ACTGTGTTCCCTGGGAACAAGACCCACAACGCTGTGTGCGTCCCAGGGTCCCCGCCGGCA 

GAGCOTCTTGGGTGGCTGACCGTCGTCCTCCTGGCCGTGGCCGCCT 

ACCTCGGCCCAGCTTGGACTGCACATCTGGCAGCTGAGGAGTCAGTC 

GAGACCCAGCTGCTGCTGGAGGTGCCG CCGTCGAC CGAAGACGCCAGAAGCTGC CAGTTC 

CCCGAGGAAGAGCGGGGCGAG^CGATCGGCAGAGGAGAAGGGGCGGCTGGGAGACCTGTGG 

GTGTGAGCCTGGCCGTCCTCCGGGGCCACCGACCGCAGCCAGCCCCTCCCCAGGAGCTCC 

C CAGGCCGCAGGGGCTCTGCGTTCTGCTCTGGGCCGGGCCCTGCTCCCCTGGCAGCAGAA 

GTGGGTGCAGGAAGGTGGCAGTGACCAGCGCCCTGGACCATGCAGTTC 
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Figure 36 

MAQHGAMGAFRALCGIAIXCALSLGQR 

YPGBECCSBWDCMCVQPBFHCGDPCCTTCRHHPCPPGQGVQSQGKFS FGFQCI DCASGTF 
S GGHEGHCKPWTDCTQFGFIjTVFPGNKTHNAVCVPG S P PAEPIX3WLTVVLLAVAACVLLL 
TSAQLGLHIWQ1J*SQCMWPRETQLI*LEVPPSTEEARS^ 
V 
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Figure 37 

GAAAGCTATAGGCTACCCATTCAGCTCCCCTGTCAGAGACTC^ 

GCAAAGAGCAAGGAAAGAGAGAAAACAACAAAGTGGCGAGGCCCT 

AGGTTCAGTCAGCCTGCTGCAGCITTGCAGACCT^ 

AAGGAAGAGCCTTCCTCACCCAAACCCACAAAAG 

ATGCTGAAAAAGCCTCTCTCAGCTGTGACCTGGCTCTGCATTT^ 

AGCCACCCAGCGTGGCTGCAGAAGCTCTCTAAGCAC^GAC^ 

AAAGCGGCCAACTGCTGTGAGGAGGTGAAGGAGCTCT^GGCCCAAGTTGCCAACCTTAGC 

AGCCTGCTGAGTGAACTGAACAAGAAGCAGGAGAGGGACTGGGTCA^ 

GTGATGGAGCTGGAGAGCAACAGCAAGCGCATGGAGTCGCGGCTCACAGATGCTGAGAGC 

AAGTACTCCGAGATGAACAACCAAATTGACATCATGCAGCTGCAGGCA 

ACTCAGACCTCCGCAGATGCaTCTACGACTGCTCITCCCTCTACC^ 

ATCTCTGGAGTGTATAAGCTTCCTCCT^ 

TTCTGTGACATGGAGACTTCAGGCGGAGGCTGGACCATCA 

CTTCTCTCCTTCTACCGGGACTGGAAGCAGTACAAGCAGGGCTTTGGCAGCATCCGTGOT 
GACTTCTGGCTGGGGAACGAACACATCCACCGGCTCTCCAGAC^ 

GTAGAGATGGAGGACTGGGAGGGCAACCTG CGCTACG CTGAGTATAGCCACTTTGTTTTG 
GGCAATGAACTCAACAGCTATCGCCTCTTCCTGGte^ 

ACGCCCTCCAGTATCATAACAACACZAGCCTTCAGCACCAAGGACAAGGA^ 

GCTTGGACAAGTGTGCACAGCTCCGCAAAGGTGGCTACTGGT^ 

CCAACCTCAATGGAGTGTACTACCGCCTOGGTGAGCACAA 

CCTGGTATGGCTGGCATGGATCTACCTACTCCCTCAAACGGGTGGAGATGAAAATCCGCC 

CAG AAGACTT CAAG CCTTAAAAGGAGGCTGC CGTGGAGCACG G ATACAG AAACTGAG ACA 

CGTGGAGACTGGATGAGGGGAGATGAGGACAGGAAGAGAGTGTTAGAA 

AGGGTAGGACTGAGAAACAGCCTATAATCTC CAAAGAAAGAATAAGTCTCCAAGGAGCAC 

AAAAAAATCATATGTACCAAGGATGTTACAGTAAACAGGATGAACTATTTAAACCCACT^ 

GGTCCTGCCACATCCTTCTCAAGGTGGTAGACTGAGTGGGGTCTCrCTGCCCM 

TGACATAGCAGTAGCTTGTCTTTTCCACATGATTTC 

ATCGTTTTATCTATTTTCTCTACGGCTTAGGCTATC 

GCTAAAAAGAACCATATTATTTTGATTCTCAAAGGATAGGCCTTT^ 

GAGTGAAGGAGGCAGGTGGGAAATGGTATTTCTATTTTTAAATCCAGTGAAATTATCTTG 

AGTCTACACATTATTTTTAAAACACAAAAAl'TGrrCGGCTGGAACTGACC 

TTGCGGGGAGGAAACTCCAGGGCACTGCATCTGGCGATCAGACTCTGAGCACTGCCCCTG 

CTCGCCTTGGTCATGTACAGCACTGLAAAGGAATGAAGCACCAGCAGGAGGTG 

CTCTC^TGGATGCCGGCAC^AAACTGCCTTAAAATATTCATAGTTAATA 

ATTTTTATTTACTTTGTAAGAAACAAGCTCAAGGAGCTTCCTTTTAAAT^ 

GAAATGGTTGAAAACTGAAGGTAGATGGTGTTATAGTTAATAATAAATGCTCT 

CATCTCACTTTGTAAAAATAAAATATTGTGGTTTTGTTTTA 

CCTTCTACAATAAACACTTTCAAAATGTG 
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Figure 38 



MLKKPLSAVTWLCI F XVAFVSHPAWLQKLSKHKTPAQ PQLKAANCCEEVKELKAQVANLS 
S LLSELNKKQERDWVS WMQVMEL ESNS KRMES RLTDAES KYS EMNNQI D IMQLQAAQTV 
TQTSADAIYDCSSLYQKNYRISCTVYKI^ 

LVS FYRDWKQYKQGFGS I RGDFVnJGNraiHRI^SRQPTRI^VEMEDWEGNLRYAEYSHFVL 
GNBI«NS YRIJ'TjGNYTGNVGNDALfO YHNNTAFS TKDKDNDNCLDKCAQLRKGGYWYNCCTD 
SNLNGVYYRI^EHNKHLDGITWYGWHGSTYSLKRVEmiRPBDFKP 
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Figure 2 9 

GGAAGTCCACGGG<^GCTTGGATGCCAA^ 
ACTCACTGGCATATTTCTGAGGTATCTGTAGAAT^ 
TTACAGTCCCACAGAACCGTCCTCCCAGGAAGCTGAATCCAGCAAGAACA 
ATGG AGGCCAG CGGGAAG CTCATTTGCAGACAAAGG CAAGTC CT l p i" ' rTTTTrTrm 

TTGGGCTTATCTCTGGCGGGCGCGGCGGAACCTAGAAGCTATTCTGTGGTGGAGGAAA^ 

GAGGGCAGCTCCTTTGTCACCAATTTAGCAAAGGACCTG 

TCCAGGCGGGGGGTTAGGGTTGTTTCCAGAGGGAACAAACTA^ 

GAGACCGCGGATTTGTTCCTAAATGAGAAATTGGACCGTGAGGATCTGTGCGGTCACACA 
GAGCC CTGTGTGCTACGTTTCCAAGTGTTG CTAGAGAGTCCCTTCGAGTTTTTTCAAGCT 
GAGCTGCAAGTAATAGACATAAACGACCACTCTCCAGTATTTC 

GTGAAAG TATCAGAGAGCAGTCCTCCTGGGACTACGTTTCCTCTGAAGAATG CCGAAGAC 

TTAGATGTAGGC CAAAACAATATTGAGAACTATATAATCAGCCCCAACTCCTATTTTC^ 

GTCCTCACCCGCAAACGCAGTGATGGCAGGAAATACCCAGAGCTGGTGCTGGACAAAGCG 

CTGGACCGAGAGGAAGAAGCTGAGCTCAGGTTAACACTCACAGCACTGGATGGTGGCTCT 

CCGCCCAGATCTGGCACTGCTCAGGTCTACATCGAAGTCCTGGATGTCAACGATAATGCC 

CCTGAATTTGAGCAGCCTTTCTATAGAGTGCAGATCTCTGAGGACAGTCCGGTAGGCTTC 

CTG GTTGT GAAGGTCTCTGCCACGGATGTAGACACAGGAGTCAACGGAGAGATTTCCT 

TCACTTTTCCAAGCTTCAGAAGAGATTGGCAA^ 

GAAATTGAACTAAAAAAACAACTCGATTTCGAAAAACTT 

GAGGCAAGAGATGCTGGAACCTTTTCTGGAAAATGCACCGTTCTG^ 

GTGAACGACCATGCCCCAGAAGTTACCATGTCTGCATTTACCAGCCCAATACCTGAGAA^ 

GCGCCTGAAACTGTGGTTGCACTTTTCAGTGTTTCAGATCTTGATTCAG 

AAAATTAGTTGCTCCATTCAGGAGGATCTACCCITCCTCCTGAAATCCGCGGAAAACT 

TACACCCTACTAACGGAGAGACCACTAGACAGAGAAAGCAGAGCGGAATACAACATCACT 

ATCACTGTCACTGACTTGGGGACCCCTATGCTGATAACACAGCTCAATATGACCGTGCTG 

ATCGCCGATGTCZAATGACAACGCTCCCGCCTTCACCCAAACCTCCTACACCCTGTTCCTC 

CGCGAGAACAACAGCCCCGCCCTG CACATCCGCAG CGTCAG CG CTACAGACAGAGACTCA 

GGCACCAACGCCC^GGTC^CCTACTCGCTGCTGCCGCCCCAGGACCCXSCACCT 

ACATCCCTGGTCTCCATC^CGCGGACAACGGCCACCTGTTCGCCCTCAGGTCTCTGGAC 

TACGAGGCCCTGCAGGGGTTCCAGTTCCGCGTGGGCGCTTCAGACCACGGCTCCCCGGCG 

GTGCTGTACCCGCTGCAGAACGGCTCCGCGCCCTGCACCGAGCTGGTGCCCCGGGCGGCC 
GAGCCGGGCTACCTGGTGACCAAGGTGGTGGCGGTGGACGGCGACTCGGGCCAGAACGCC 
TGGCTGTCGTACCAGCTGCTCAAGGCCACGGAGCTCGGTCTGTTCGGCGTGTGGGCGCAC 
AATGGCGAGGTGCGCACCGCCAGGCTGCTGAGCGAGCGCGACGCGGCCAAGCACAGGCTG 
GTGGTGCTGGTCAAGGACAATGGCGAGCCTCCGCGCTCGGCCACCGCCACGCTGCACGTG 
CTCCTGGTGGACGGCTTCTCCCAGCCCTACCTGCCTCTCCCGGAGGCGGCCCCGACCCAG 
GCCCAG GCCG ACTTGCTCACCGTCTACCTCGTGGTGGCGTTGGCCTCGGTGTCTTCGCTC 
TTCCTCTTTTCGGTGCTCCTGTTCGTGGCGGTGCGGCTGTGTAGGAGGAGCAGGGCGGCC 
TCGGTGGGTCGCTGCTTGGTGCCCGAGGGCCCCCTTCCAGGGCATCTTGTGGACATGAGC 

ACCAATGAGTTCAAGTTCCTGAAGCCGATTATCCCCAACITCCCTCCCCAGTGCCCT 
AAAGAAATACAAGGAAATTCTACCTTCC CCAATAACTTTGGGTTCAATATTCAGTGA CCA 
TAGTTGACTTTTACATTCCATAGGTATTTTATTTTGTGGCATTTCCA 
TTCCCCCAATTTGTGTGTATGTAATATTGTACGGATTTACTCTTGATTTCT 
TTTCTCCCTTTGTTTTAAAGTGAACATTTACCT^ 
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Figure 40 

MEASGKLI CRQRQVLFS FI^GLSLAGAAEPRS YS WEETEGS3 

S RRGVR WS RGNKLHLQLNQ ETADLLLNEKLDREDLCGKTE P CVLRFQVLLES P FEFFQA 
ELQVTDINDHSFVT1J)KQKLVKVSESSPPGTTFPLKNA£DLI^ 

VLTRKRSDGRKYPELVLDKALDREEEAELRLTLTALDGGS PPR5GTAQVYI EVLDVNDNA 
PEFEQPFYRVQI S EDS PVGFLWKVSATDVDTGVNGEIS YSLFQAS EEI GKTFXINPLTG 
EIEIJCKQLDFEKWSYEVNIEAMAGTFSGKCTVLIQVIDVND 

APETVVALFSVSDI^SGENGKISCSIQEDI^FLLKSAENFTTLLTERPI^RESRAEYNIT 
I TVTDLGT PML I TQLNMTVL I ADVNDNAPAFTQTS YTLFVRENNS PAI*H I RSVSATDRDS 
GTNAQVTYSI^PPQDPHLPLrSLVSINADNGHIuFAIJlSI^YEALOGFQFRVGASDHGSPA 
LSS EALVRVWLDANDNS PFVL YPLQNGSAPCTELVPRAAE PG YLVTKWAVDGDSGQNA 
Wl*SYQUJCATEI^FGWAHNGEVRTARI^ERDAAKERLVVt 
IJWTCFSQPYLPLPEaUU>TQAQADLLTVYLWAI^ 

SVGRCLVPEGPLPGHLVDMSGTRTLSQSYQYEVCIAGGSGTOEFKFLKPI I PNFPPQCPG 
KE I QGNSTFPNNFG FN I Q 
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Fi cure 4 1 



GCTCCCAGCCAAGAACCTCGGGGCCGCTGCGCGGTGGGGAGGAGTTCCCCGAAACCCGGC 
CGCTAAGCGAGGCCTCCTCCTCCCGCAGATCCGAACGGCCTGGGCGGGGTCACCCCGGCT 
GGGACAAGAAGCCGCCGCCTGCCTGCCCGGGCCCGGGGAGGGGGCTGGGGCTGGGGCCGG 
AGGCGGGGTGTGAGTGGGTGTGTGCGGGGGGCGGAGGCTTGATGCAATCCCGATAAGAAA 
TGCTCGGGTGTCTTGGGCACCTACCCGTGGGGCCCGTAAGGCGCTACTATATAAGGCTGC 
CGGCCCGGAGCCGCCGCGCCGTCAGAGCAGGAGCGCTGCGTCCAGGATCTAGGGCCACGA 
CCATCCCAACCCGGCACTCACAGCCCCGCAGCGCATCCCGGTCGCCGCCCAGCCTCCCGC 
ACCCCCATCGCCGGAGCTGCGCCGAGAGCCCCAGGGAGGTGCC 

GCCGGGCGCCCCCTCGCCTTCTCGGACGCGGGGCCCCACGTGCACTACGGCTGGGGCGAC 
CCCATCOKrCTGCGGCACCTGrrACACCTC 

CGCATCCGTGCCGACGGCGTCGTGGACTGCGCGCGGGGCCAGAGCGCGCACAGTTTGCTG 
GAGATCAAGGCAGTCGCTCTGCGGACCGTGGCCATCAAGGGCGTGCACAGCGTGCGGTAC 
C TCTG CATGGGCGCCGACGGCAAGATGCAGGGGCTGCTTCAGTACTCGGAGGAAGACrrGT 

GCTTTCGAGGAGGAGATCCGCCCAGATGGCTACAATGTGTACCGATCCGAGAAGCACCGC ( 

CTCCCGGTCTCCCTGAGCAGTGCCAAACAGCGGCAGCTGTACAAGAACAGAGGCTTTCTT 

CCACTCT CT CATTTCCTG CC CATGCTG CC CATGGTCC CAGAGGAGCCTGAGG ACCTCAGG 

GG CCACTTG G AA TCTG ACATGT7 CT CTTCG C CCC TGG AG AC CGACAG CATGGAC CCATTT 

GGGCTTGTCACCGGACTGGAGGCCGTGAGGAGTCCCAGCTTTGAGAA GTAA CTGAGACCA 

TGCCCGGGCCTCTTCACTGCTGCCAGGGGCTGTGGTACCTGCAGCGTGGGGGACGTGCTT 

CTACAAGAACAGT CCTGAG TCCACGTTCTGTTTAGCTTTAGGAAGAAA 

GTACATATTCAGAGTTTTCCATTGGCAGTGCCAGTTT 

ATAACATTGTAAGCCT 

GTAGCTTGCCCAGCTGCTGCCTGGGCCCCCATTCTGCTCCCTCGAGGTTGCTGGACAAGC 
TGCTGCACTCTCTCAGTTCTGCTTGAATACCTCCATCGATGGGGAACTC^ 
AAAAATTCTTATGTCAAGCTGAAATTCTCTAATTTTTTCTCA^ 
CCAGAAGACAGGCAGTAGTTTTAATTTCAGGAACAGGTGA^^ 

GTAAATTTCACTCAACCCCATG TGGGAATTGATCTATAT CTCTACTTCCAGGGA C CATTT 

GCCCTTCCCAAATCCCTCCAGGCCAGAACTGACTGGAGCAGGCATGGCCCACCAGGCTTC 

AGGAGTAGGGGAAGCCTGGAGCCCCACTCCAGCCCTGGGACAACTTGAGAArrCCCCCTG 

AGGCCAGTTCTGTCATGGATGCTGTCCTGAGAATAACTTGCTGTCCCGGTGTCACCTGCT 

TCCATCTCCCAGCCCACCAGCCCTCTGCCCACCT 

TCCCAGGCCCCCCACCTTATGTCAACCTC^ 

GATTTGAAGACCCCAAGTCTIX3TCAATAACTTGCTGTC 

AGAACCCT1TCCCCAGCACTTGGTTTTCCAACATGATATTTATGA 

ATGTACATCTCTTATTTTCTTACATTATTTATGCCC c CAAATTATATTTATGTATGTAAG 
TGAGGTTTG TTTTG TATATTAAAATG GAGTTTGTTTGT 
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Figure 42 

MRSGCWVHVWI LAGLWLAVAGRP LAFSD AG PHVHYGWGDP I RLRHL VTSGPHGLSSCF 
LR I RADGWDCARGQSAHSLLE I KAVALRTVA I KGVKSVR YLCMGADG1Q4QGLL.QYSEE 

dcafeeeirpdgynvyrsekhrlpvsi^saxqrqlyx^ 

DLRGKLESDMFSSPLETDSMDPFGLVTGLEAVRSPSFEK 
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Figure 43 

GGTCTCGCTCTGTCACACAGGCTGGAGTGCAGTGGTGTGATCTTGGCTCATCGTAACCTC 

CACCTCCCGGGTTCAAGTGATTCTCATGCCTCAGCCTCCCGAGTAGCTGGGATTACAGG^ 

GGTGACTTCCAAGAGTCACTCCGTCGGAGGAAA 

AT^CTCCCCAGTCGCTGCTGCAGACGACACTGTTCCTGCT 

CAAGGTGCCCACGGCAGGGGCCACAGGGAAGACTTTCGCT^ 

ACACACAGGAGCAGCCTCCACTACAAACCCACACCAGAC^ 

TCCGAAGAGGCCCTCACAGTCCATGCCCCTTTCCCTGCAGCCCACCCTGCTTCCCGATCC 

TTCCCTGACCCCAGGGGCCTCTACCACTTCTGCCTCTACTGGAACCGACATGCTGGGAGA 

TTACATCTTCTCTATGGCAAGCGTGACTTCT7GCTGAGTGACAAAGCCTCT 

TGCTTCCAGCACCAGGAGGAGAGCCTGGCTCAGGGCCCCCCGCTGTTAGCCACTTCTGTC 

ACCTCCTGGTGGAGCCCTCAGAACATCAGCCTGCCCAGTGCCGCCAGCTTCACCTTCTCC 

TTCCACAGTCCTCCCCACACGGCCGCTC^CAATGCCTCGGTGGACATGTGCGAGCTCAA^ 

AGGGACCTCCAGCTGCTCAGCCAGTTCCTGAAGCATCCCCAGAAGGCCTCAAGGAGGCCC 

TCGGCTGCCCCCGCCAGCCAGCAGTTGCAGAGCCTGGAGTCGAAACTGACCTCTGTGAGA 

TTCATGGGGGACATGGTGTCCTTCGAGGAGGACCGGATCAACGCCACGGTGTGGAAGCTC 

CAGCCCACAGCCGGCCTCCAGGACCTGCACATCCACTCCCGGCAGGAGGAGGAGCAGAGC 

GAGATCATGGAGTACrCGGTGCTGCTGCCTCGAACACTCTTCCAGAGGACGAAAGGCCGG 

AGCGGGGAGGCTGAGAAGAGACTCCTCCTGGTGGACTTCAGCAGCCAAGCCCTGTTCCAG 

GACAAGAATTCCAGCCAAGTCCTGGGTGAGAAGGTCTTGGGGATTGTGGTACAGAACACC 

AAAGTAGCCAACCTCACGGAGCCCGTGGTGCTCACTTTCCAGCACCAGCTACAGCCGAAG 

AATGTGACTCTGCAATGTGTGTTCTGGGTTGAAGACCCCACATTGAGCAGCCCGGGGCAT 

TGGAGCAGTGCTGGGTGTGAGACCX^TCAGGAGAGAAACCCAAACATCCTC 

CACTTGACCTACTTTGCAGTGCTGATGGTCTCCTCGGTGGAGGTGGACGCCGTGCACA^ 

CACTACCTGAGCCTCCTCTCCTACGTGGGCTGTGTCGTCTCTGCCCTGGCCTGCCTTGTC 

ACCATTGCCGCCTACCTCTGCTCCAGGGTGCCCCTGCCGTGCAGGAGGAAACCTCGGGAC 

TACACCATCAAGGTGCACATGAACCTGCTGCTGGCCGTCTTCCTGCTGGAC^CGAGCTTC 

CTGCTCAGCGAGCCGGTGGCCCTGACAGGCTCTGAGGCTGGCTGCCGAGCCAGTGCCATC 

TTCCTGCACTTCTCCCTGCTCACCTGCCTTTCCTGGATGGGCCTCGAGGGGTACAACCTC 

TACCGACTCGTGGTGGAGGTCTITGGCACCTATGTCCCTGGCTACCTACTCAAG 

GCCATGGGCTGGGGCTTCCCCATCTTTCTGGTGACGCTGGTGGCCCTGGTGGATGTGGAC 

AACTATGGCCCCATCATCTTGGCTGTGCATAGGACTCCAGAG 

ATGTGC TGGATCCGGGACTCCCTGGTCAGCTACATCACCAACCTGGGCCT 

GTGTTTCTGTTCAACATGGCCATGCTAGCCACCATGGTGGTGCAGATCCTGCGGCTGCGC 

CCCCACACCCAAAAGTGGTCACATGTGCTGACACTGCTGGGCCTCAGCCTGGTCCTTGGC 

CTGC CCTG GGCCTTGATCTTCTTCTCCTTTGCTTCTGGCACCTTCCAGCTTGTCGTCCTC 

TACCTT1TCAGCATCATCACCTCCTTCCAAGGCTTCCTCATCTTCATCTGGTACTGCT 

ATGCGGCTGCAGGCCCGGGGTGGCCCCTCCCCTCTGAAGAGCAACTCAGACAGCGCCAGG 

CTCCCCATCAGCTCGGGCAGCACCTCGTCCAGCCGCATCTAGGCCTCCAGCCCACCTGCC 

CATGTGATGAAGCAGAGATGCGGCCTCGTCGCACACTGCCTGTGGCCCCCGAGCCAGJGCC 

CAGCCCCAGGCCAGTCAGCCGCAGACTTTGGAAAGCCCAACGACCATGGAGAGATGGGCC 

GTTGCCATGGTGGACGGACTCCCGGGCTGGGCTTTTGAATTGGCCTTGGGGACTACTCGG 

CTCTCACTCAGCTCCCACGGGACTCAGAAGTGCGCCGCCATGCTGCCT 

CCACATCTGTCCCAACCCAGCTGGAGGCCTGGTCTCTCCTTACAACCCCTGGGCCCAGCC 

CTCATTGCTGGGGGCCAGGCCTTGGATCTTGAGGGTCTGGCACATCCTTAATCCTGTG 

CCTGCCTGGGACAGAAATGTGGCTCCAGTTGCTCTGTCTCTCGTGGTCACCCTGAGGGCA 

CTCTGCATCCTCTGTCATTTTAACCTCAGGTGGCACCCAGGGCGAATGGGGCCCAGGGCA 

GACCTTCAGGGCCAGAGCCCTGGCGGAGGAGAGGCCCTTTGCCAGGAGCACAGCAGCAGC 

TCGCCTACCTCTGAGCCCAGGCCCCCTCCCTCCCTCAGCCCCCCAGTCCTCCCTCC^TCT 

TCCCTGGGGTTCTCC TCCTCTC CCAGGGCCTCCTTGCTCCTTCGTTCACAGCTGGGGGTC 

CCCGATTCCAATGCTGTTTTTTGGGGAGTGGT^ 

AATGTTTGTCTACTGCACAAGCCTCGGCCTGCCCCTGAGCCAGGCTCGGTACCGATGCGT 
GGGCTGGGCTAGGTCCCTCTGTCCATCTGGGCCTTTGTATGAGCTGCATTGCCCTTGCTC 
ACCCTGACCAAGCACACGCCTCAGAGGGGCCCTCAGCCTCTCCTGAAGCCCTCTTGTGGC 
AAGAACTGTGGACCATGCCAGTC CCGTCTGGTTTCCATCC CACCACTCCAAGGACTGAGA 
CTGACCTCCTCTGGTGACACTGGCCTAGAGCCTGACACTCTCCTAAGAGGTTCTCTCCAA 
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GCCCCCAAATAGCTCCAGGCGCCCTCGGCCGCCCATCATGGT^ 

ACACACGGGTAGATTGCTGGCCTGTTGTAGGTGGTAGGGACACAGATGACCGACCTGGTC 
ACTCCTCCTGCCJ^CATTCAGTCTGGTATCTGAGGCGTGCCT 

GCTACAGGQACAGGGAGCCATCATTCCTGCCTGGGAATCCTGGAAGACTTCCTGCAGGAG 
TCAGCGTTCAATCTTGACCTTGAAGATGGGAAGGAT G 1 1 CTTTTTACGTACCAATTCTTT 
TGTCTTTTGATATTAAAAAGAAGTACATGTTCATTGTAGAGAATTTGGAAA 
AGAATCAAGAAGAAAAATAAAAATCAGCTGTTGTAATCGCCTAGCAAAAAAAAAAAAAAA 

AAAAAA 
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Figure 44 

MTPQSIJjQTTLFLLSLLFLVQGAHGRGHREDFRFCSQRNQTHRS^ 

SEEALTVHAPFPAAHPASRSFPDPRGLYHFCLYWin^GRIJILLY^ 

CFOHQEESLACy3PPlJ^TSVTSWWSPQNISLPSAASFTFSraSPPHTAAHNASVDMC&K 

RDLQLLSQFLKHPQKASRRPSAAPASQQLQSLSSKLTSVRFMGDMVSFEEDRINATWKL 

QPTAGWDLHIHSRQEEEQSEIMF^SVLLPRTLFQRTKGRSGEAEKRLLLVDFSSQALFQ 

DKNSSQVLGEKVLGIWQNTKVANLTEPWLTFQHQI^ 

WSSAGCETVRRETQTSCFCHHLTYFAVLMVSSVETVDAVHKOT 

TIAAYLCSRVPLPCRRKPRDYTIKVHMNLLIAVT^ 

FLHFS IXTCLSWMGLEGYNLYRLVVEVFGTYVPGYLLKLSAMGWGFP I FLVTLVALVDVD 
NYGPIIIJVVHRTPEGVIYPSMCTIRDSLVSYITNLGLFSLVTLFNMAMIATMW 
PHTQKWSHVIiTLLGLSLVLGLPWALI FFS FASGTFQLWLYLFS I ITSFQGFLIFIWYWS 
MRLQARGGPSPLKSNSDSARLPISSGSTSSSRI 
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Figure 45 

GCGAGGTGGCGATCGCTGAGAGGCAGGAGGGCCGAGGCGGGCCTGGGAGGCGGCCCGGAG 
GTGGGGCGCCGCTGGGGCCGGCCCGCA'CGGGCTTCATCTGAGGGCGCACGGCCCGCGArc 
GAGCGTGCGGACTGGCCTCCCAAGCGTGGGGCGACAAGCTGCCGGAGCTGCA 
ATGGGCCGCGGCTGGGGATTCTTGTTTGGC^ 

GGCCACGGAGAGGAGCAGCCCCCGGAGACAGCGGCACAGAGGTGCTTCTGCCAGGTTAGT 

GGTTACTTGGATGATTGTACXrrGTGATGrrGAAAC 

CTTTTCCCAAGACTACAAAAACTTCTT^^ 

CTGAAGAGGCCGTGTCCTTTCTGGAATGACATCAGCCAGTGTGGAAGAAGGGACT 
GTCAAACCATGTCAATCTGATGAAGTTCCTGATGGAATTAAATCTG 
TCTGAAGAAGC CAATAATCTCATTGAAGAATGTGAACAAGCTGAACGACTTGGA 
GATGAATCTCTGAGTGAGGAAACACAGAAGG CTGTTCTTCAGTGGACCAAGCATGATGAT 
TCTTCAGATAACTTCTGTGAAGCTGATGAC^TTCAGTCC^ 
TTGCITCTTAATCCI^GCGCTACACTC 
AATGTCATCTACGAAGtfU^CTGTTTTAAGCCA^ 
TTGGCTTCTGGTCAAGGGACAAGTGAAGAGAACACTTTT^ 
TGTGTAGAAAAAAGAGCATTCTACAGACTTATAT CTGGCCT^ 

CA I L i I\jAG TG CAAGATAT CTTTTACAAGAGAC CTGGTTAGAAAAGAAATGGGGACACAAC 

ATTACAGAATTTCAACAGCGATTTGATGGAATTTTGACTGA^ 

CTTAAGAACTTGTATTTTCTCTACTTAATAGAACTAAGGGCTT^ 

TTCTTCGAGCGCCCAGATTTTCAACTCTTTACTGGAAATAAAATTCA 

AAAATGTTACTTCTGGAAATACTTCATGAAATCAAGTCATTTCCTTTC 

AATTCA1 r 1 TTl^CTGGGGATAAAAAAGAAG CACACAAACTAAAGGAGGACTTTCGACTG 

CATTTTAGAAATATTTCAAGAATTATGGATTGTGTTGGTTGTTTTAAA 

GG^AAGCTTCAGACTCAGGGTTTGGGCACTGCTCTGAAGftTCTTAT T TT C LX jA GAAATTG 

ATAGCAAATATGCCAGAAAGTGGACCTAGTTATGAATTCCATCTAACCAGACAAGAAATA 

GTATCATTATTCAACGCATTTGGAAGAATTTCTACA^ 

AGGAACTTGTTACAGAATATTCATTAAAGAAAACAAGCTGATATGTC 

AATGGAGGCGAAAGAGTGGAATTTCATTCAAAGGCATAATAGCAATGACAG^ 

AAACATTTTATATAAAGTTGCTTTTGTAAAGGAG 

TTTTAAAAATTGTGTTAAGTCTATGTATAATACTACTGTGAGTAA 

AATGTGGTACAAATTTTAAAGTTTAATATTGAATAAAAGGAGGATT^ 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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Figure 46 

MGROTGFLFCTXGAVWI^SGHG^ 
LFPRI^KI^ESDYFRYYKVNLKKPCPFWND^ 

SEEANNLI EECEQAERLGAVDESLS EETQKAVLQWTKHDDSSDNFCEADDIQSP EABYVD 
LLLNPER YTGYKGPDAWKIWNVI YE EN CFKPQTI KRPLNPLASGQGTS EENTFYS WLEGL 
CVEKRAFYRLISGLHASINVHLSARYLLOETWI^EKKWGHNITEFQORFDGILTEGEGP^ 
LKNL YFL YL I ELRALS KVT.P FFERPDFQLFTGNKI QDEENKMLLL EI LHE I KSFPLHFDE 
NS FFAGDKKEAHKLKEDFRLHFRNI SR IMDCVGCFKCRLWGKLQTQGLGTALKI LFSEKL 
IANMPESGPSYEFHLTRQEIVSLFNAFGRISTSVKELENFRNLLQNIH 
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Figure 47 

GCCACCTTGTCTTCTTTCC^ 

ATCTTTTCTCCCAGCCCCAGGACACTGACTCTGTACAGG 

ATGGGGCCGTC l TCT TGC CTC CTT CTCATCCTAATCCCCCTTCTCCAGCTGATCAACCCG 
GGGAGTACTCACTGTTCCTTAC^CTC^ 

AGTCTAGAGTACAGTCCCTCTCCTATAAGCAAGAAGCTCTCGTGTGCTAGTGTCAAAAGC 

CAAGGCAGACCGTCCTCCTGCCCTGCTGGGATGGCTGTCACTGGCTGTGCTTGTG 

GGCTGTCGTTCGTGGGATGTTCAGCTGGAAACCACCTGCCACTG 

GACTGGACCACTGCCCGCTGCTGCC^CCTGACCTGACAGGGA 

TTTGTGACCATGACAGTAATGLAAACCAGGGTCCCAACCAA 

C^CTTCATTTGTTCCATTCCTGAT^^ 

AAAAAAAAAAAAAAAA 
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Figure 48 

QGRPSSCPAGMAOTGCACGYKGSWD^LETTTCHCOCSVVtiWTTARCCHLT 



( 



(' 
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Figure 4 5 

GGCACGAGGGGGACAGGAGCTAATACCCAGAACTGAGTTGT^ 
CACGTACCCACGGG 

ATGAAGAACCTTTCATTTCCCCTCCTTTTCCTTTTCTTCCTTGTCCCT 

TCCAGCATGCCACTGTGTCCCATCGATGAAGCCATCGACAAG^ 

AACTCCCTGTTTCCAAATGCAATAAAGAACATTGGCT 

AGAGGGAAGTTGGCCTCCTGCCCAGAAGGCACAGCAGTCTTGAGCTGCTCCTGTGGCTCT 
GCCrGTGGCTCGTGGGACATTCGTGAAGAAAAAGTGTCTC^ 

GACTGGACAGCAGCCCGCTGCTGTAAGCTGCAGGTCGCTTC CTGA TGTCGGGGAAGTGAG 
CGTGGTTTCCAGCACAGCCAC CCGTTCCTGTAGCTCCAGAGATGTCTGATGTCCTC CGGT 
CTCTACAGGCACCTGCACTCACGTGCGCGAATCCACACACAAGCA.C^CATACTTAA 
AAAACAAAACAGGCTGGAAAAAAAAAAAAAAAAAAAAAAA 
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Figure SO 

MKNl^F^IXFLFFLVPELLGSSMPLCPIDEAIDKKIKQDFNSLFPNAIKNIGIJ^CWTVS 
SRGKLASCPEGTAVLSCSCGSACGSWDIREEKVCKCQCARIDWTAARCCJaQVAS 
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Figure 51 

CCAGTCTCTCGCCACCTCACTTGGTCTCTGCTG 
GCACAGAGGAGTGGGCCGGGACC 

ATGCGGGGGACGCGGCTCGCGCTCCTGGCGCTGGTGCTGGCTG CCTGCGGAGAGCTGGCG 
CCGGCCCTGCGCTGCTACGTCTGTCCGGAGCCCACAGGAGTGTCGGACTGTGTCACCATC 
GCCACCTGCACCACCAACGAAACCATGTGCAAGACCACACTCTACTCCCGGGAGATAGTG 
TACCCC7TCCAGGGGGACTCCACGGTGACCAAGTCCTGTGCCAGCAAGTGTAAGCCCTCG 
GATGTGGATGGCATCGGCCAGACCCTGCCCGTGTCCTGC^ 

GTAGACGGGGCGCCCGCTCTGAACAGCCTCCACTGCGGGGCCCTCACGCTCCTCCCACTC 
TTGAGCCTCCGACTG^GAGTCCCCGCCCACCCCCATGGCCCTATGCGGCCCAGCCCOSA 
ATG CCTTGAAG AAGTGC CCCCTG CAC CAGGAAAAAAAAAAAAAAAAA 
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Figure 52 

MRGTRIJO^VI^CGElAPAI^Cr^CPSPTGVSDCW 

YPFQGD STVTKSCASKCKPS DVDG I GQTL PVS CCNTELCNVDGAPALNSLHCGALTLLPL 
LSLRL w 



i 



54 / 133 



(0 2003 Copyright Derwent Information Ltd. 



WO 00/53758 



PCT/US00/05841 



Figure 53 

AAAGTTACATTTTCTCTGGAACTCTCCTAGGCCACT 

TGGGCAGJ^GGAGGbT G CTTCGG&GCCCGCCC m^ ^ 

AACAATTGAGGCTTCGCTGCGACTCAGACCTCAGCT^ 

ATGGCTGAGATGGACAGAATGCTTTATTTTGGAAAGA^ 

GTCTACCAA 

ATCCAGACTTTCACAATGGTTCTAGAAGAAATCTGGACAAGTCTTTTCATG TGGTTTTTC 

TACGCATTGATTCCATGTTTGCTCACAGATGAAGTGGCCATT^ 

CTCTCTGTACTCTCAACCAACATGAAGCATCTCTTGAT 

GGAGAAACAGTGTACTATTCTGTCGAATACCAGGGGGAGTACGAGAGCCTGTACACGAGC 

CACATCTGGATCCCCAGCAGCTGGTGCTCACTCACTGAAGGTCCTGAGTGTGATGTCACT 

GATGACATCACGGCCACTGTGCCATACAACCTTCGTGTCAGGGCCACATT^ 

ACCTCAGCCTGGAGCATCCTGAAGCATCCCTTTAATAGAAACTCAACCATCCTTACCCGA 

CCTGGGATGGAGATCACCAAAGATGGCTTCCACCTGGTTATT^ 

CCC CAG TTTGAGTTC CTTGTGG CCTACTGGAGGAGGGAGC CTGGTG CCGAGG AACATGTC 

AAAATGGTGAGGAGTGGGGGTATTCCAGTGCACCTAGAAACCATGGAGCCAGGGGCTGCA 

TACTGTGTGAAGGCCCAGACATTCGTGAAGGCCATTGGGAGGTACAGCGCCTTCAGCCAG 

ACAGAATGTGTGGAGGTGCAAGGAGAGGCCATTCCCCTGGTACTGGCCCTGT^ 

GTTGGCTTCATGCTGATCCTTGTGGTCGTGCCACTGTTCGTCTGGAAAATGGGCCGGCTG 

CTCCAGTACTCCrrGTTGCCCCGTGGTGGTCCTCCCAGACACCTTGAAAATAACCAATT 

CCCCAGAAGTTAATCAGCTGCAGAAGGGAGGAGGTGGATGCCTGTGCCACGGCTGTGATG 

TCTCCTGAGGAACTCCTCAGGGCCTGGATCTCATAGGTTTGCGGAAGGGCCCAGGTGAAG 

CCGAGAACCTGGTCTGCATGACATGGAAAC 

CATGAGGGGAC^GTTGTGTTTCTGTTTTCCGCOVCGGACAAGGGATGA 

AGAGCCTGTTGTCTACAAGTCTAGAAGCAACCATCAGAGGCAGGGTGGTTTGTCTAACAG 

AACACTGACTGAGGCTTAGGGGATGTGACCTCTAGACTGGGGGCTGCCA C rTGCTGGCTG 

AGCAACCCTGGGAAAAGTGACTTCATCCCTTCGGTCCTAA G w l w i M rrCTCATCTGTAATGGG 

GGAATTACCTACACACCTGCTAAACACACACACACAGAGTCTCTCTCT 

GTACACATAAATACACC CAGCACTTG GAAGGCTAGAGGGAAACTGGTGACACTCTACAGT 

CTGACrGATTCAGTGTTTCTGGAGAGCAGGACATAAATGT^ 

CTCTACACACTGGGTGG CTTGGAGAG CCCACTTTCCCAGAATAATCCTTGAGAGAAAAGG 

AATCATGGGAGGAATGGTGTTGAGTTCACTTCAAGCCCAATG CCGGTGCAGAGGGGAATG 

GCTTAGCGAGCTCTACAGTAGGTGACCTGGAGGAAGGTCACAGCCACACTGAAAATGGGA 

TGTGCATGAACACGGAGGATCCATGAACTACTGTAAAGTG7TGACAGTGTGTG CACACTG 

CAGACAGCAGGTGAAATGTATGTGTGCAATGCGACGAGAATGCAGAAGTCAGTAACATGT 

GCATGriTG T T U TGCTC ClTrrri 'CTGTT^ 

GCCACCCTGGCCAAAAGCGGTAAAAAAAAAAAAAAAA 



55 / 133 



(0 2003 Copyright Derwent Information Ltd. 



W ° 00/53758 PCT/US00/0S84I 



Figure 54 

MQTFTMVLEE I WTSLFMWFF YAL I PCIJjTDEVAIL FAPQNLSVLSTNMKHLLMWS PVTAP 

GETVYYSVEYQGEYESLYTSHIWIPSSWCSLTEGPECDVTDDITAW 

TSAWS ILKHPFNRNSTILTRPGMEITKDGFHLVI ELEDLGPQFEFLVAYWRREPGAEEHV 

K^fVRSGGIPVHLETMEPGAAYCVKAOTFVKAIGRYSAFSQTECVEVQGEAI PLVLALFAF 

VGFMLILVWPLFVWKMGRIJLOYSCCPVVVLPDTIJCITNSPQKLI^ 

SPEELLRAWIS 
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Figure 55 

GAGCAGGACGGAGCC 

ATGG ACCCCGCCAGGAAAGCAGGTGCCCAGGCCATGATCTGGACTGCAGGCTGGCTGCTG 

CTGCTGCTGCTTCGCGGAGGAGCGCAGGCCCTGGAGTGCTACAGCTGCGTGCA 

GATGACGGATGCTCCCCGAACAAGATGAAGACAGTGAA^^ 

TGCACCGAGGCCGTGGGGGCGGTGGAGACCATCCACGGACA^ 

GGTTGCGGTTCGGGACTCCCCGGCAAGAATGACCGC 

GCGTTCATCCAGCTGCAGCAATGCGCTCAGGArCGCTGC^ 

TCGCGGGCGCTCGACCCGGCAGGTAATGAGAGTGCATACCCGCCCAACGGCGTGGAGTGC 

TACAGCTGTGTGGGCCTGAGCCGGGAGGCGTGCCAGGGTACATCGCCGCCGGTCGTGAGC 

TGCTACAACGCCAGCGATCATGTCTACAAGGGCTGCTTCGACGGCAAC 

GCAGCTAATGTGACTGTGTCCTTGCCTGTCCGGGGCT G TGTCCAGGATGAATTCTGCACT 

CGGGATGGAGTAACAGGCCCAGGGTTCACGCTCAGTGGCTC C T G TT G CCAGGGGTCCCGC 

TGTAACTCTGACCTCCGCAACAAGACCTACTTCTCCCCTCGAATCCCA 

CTGCCCCCTCCAGAGCCCACGACTGTGGCCTCAACC^CATCTCT 

GCCCCAGTCAGACCCACATCCACCACCAAACCCATGCCM 

AGACAGGGAGTAGAA CACGAGGC CT C C CGG G ATGAGG AGC CCAGG TTGACTGGAGGCG CC 
GCTGGCCACCAGGACCGCAGCAATTCAGGGCAGTATCCTGCAAAAGGGGGGCCCGAGCAG 
CCCCATAATAAAGGCTGTGTGGCTCCCACAGCTGGATTGGCAGCCCTTCTGTTGGCC^ 
GCTGCTGGTGTCCTACTCTGAGCTTCTCCACCTGGAAATTTCCCTCTCACCTACTTCTCT 



GCCCCTGTTTTTCCAACATTCCCCAGTATCCCCAGCTTCTC 

TGGGAAATAAAATACCGTTGTATATATTCTGCCAGGGGTGTTCTAGCTTTTTGAGG^ 

CTCCTGTATCCTTCTCATCC1TGTCTCTCCGCTTGTCCTCTTGTGATGTTO 

GAGAGAAGTCAGCTGTCACGGGGAAGGTGAGAGAGAGGATGCTAAGCTTCCTACTCACTT 

TCTCCTAGCCAGCCTGGACTTTGGAGCGTGGGGTGGGTGGGACAATGGCTCCCCACT 

AGCACTGCCTCCCCTACTCCCCGCATCTTTGGGGAATCGGTTCCCCATATGTCTTCCTT^ 

CTAGACTGTGAGCTCCTCGAGGGGGGGCCCGGTACCCAATTCGCCCTATAGTGAGTCGTA 
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Figure 56 

MDPARKAGAQAMIWTAGWI^IXLIJIGGAQAI^ 
<nXAVGAVETIHGQFSIAVRGCGSGLPGKNDRGI^^ 

S RALDPAGNESAYP PNGVECYSCVGLSREACQGTS PPWSCYNASDHVYKGCFIX3NVTLT 
AAltt/TVSLPVRGCVQDEFCTRDGVTGPGm 

LPPPSPTTVASTTSVTTSTSAPVRPTSTTKPMPAPTSQTPRQGVEHEASRDEEPRLTGGA 
AGHQDRSNSGQYPAKGGPQOPHNKGCVAPTAGIAALLLAVAAGVLL 



t 
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Figure 57 

CGGCCACAGCTGGCATGCTCTGCTTG^^ 

TCGTGAACCCCGGGGTGCTCCGCACGGACCCCAGATGTCAAGAAT 

ATOAACACGTGGCTGCTGTTCCTCCCCCTGTTCCCGGTGCAGGTGCAG^ 

GTGATCATCGGGATGCTCGTGCTCCTGCTC^ 

CTGCTCATCTTCCACATCTACCTGAGTATGTCCCCCACCCTAAGCCCCCGATCCCCCCAA 
GGCTGGGTGGTCAGAGCTGCTCATCTTACACCTCTACTTGAGTATGTCCCTAACCCT 

CCCCCCACGCCTGGGGCCAGAGTCTTTGTCCCCCGTGTGCGCATGTGTTCAGGGTCAGCC 
TCTCCCAGAAGTGAGATCATGGACAAAAAGGGCAAATCACAGGAAGA^ 

AGGACCCAGCAGGCCCAGCAAGAAGCTGAACTCACGCCGAGACCTGCAGGAGTGGTGCCA 
GGTGCTT^GTAACAAGTTTAAAATGTTCAGAGACAATC^l^ 

GAACAGGAC^TTATGAAATAAGGACAGGTGGACTTCCAAAAACA 

CAATGAAATATATTACAGGCAGGTCACCCACTAACCAAACAACTGAAGCGAGAGCT 
GTCTTGCTTGGTCTCACAGTGGGCACAGCGCTAG^ 

GAGGGTAAACTCC CCAGC CCCAAGAAAACCTGTGTTGGAAGTAACAACAACCTCCCTGCT 

CCTGGCACCAGCCGTTTTGGTCATGGTGGGCCAGCTGCAAAGCGTCTTCCATTCT 

CAGTGGTGGCCCCGAGGCTGTGGCCTCTCAGGGGGTTTCTGTGGACACGGGCAGCAGAGT 

GTGTCCAGGCCAGCCCCCAAC3AATGCCCTGCTCCTGACAGCTTGGCCAACCCCTGGTCAG 

GGCAGAGGGAGTTGGGTGGGTCAGGCTCTGGGCTCACCTCCATCTCCA 

CCTGCAGTTGTGGCAAGAACGC CCAGCTCAGAATGAACACACCCCACCAAGAGCCTC CTT 

GTTCATAACCACAGGTTACCCTACAAACCACTGTCCCCACACAACCCTGGGGATGTTTTA 
AAACA CACACCTCTAACGCATATCra 

TTTTTAATGAAAGTGCAATGAAAATttCT^ 

AAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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Figure 58 

m^LFLPLFPVQVQTLIVVIIGMLVLLLDFLGLVHLGQLLIFHIYLSMSPTLSPRSPO 



( 
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Figure 59 

AGGCGGGCAGCAGCTGCJU3GCTGACCTTGCAGCTTGGCGGA 

ATGGACTGGCCTCACAACCTGCl^TTTCTTCTrACCATTTCCATCTTC 

CAGCCCAGGAGCCCCAAAAGCAAGAGGAAGGGGCAAGGGCGGCCTGGGCCCCTTC 

GG CCCTCACCAGGTGCCACTGGACCTGGT G TCACGGATGAAACCGTATGCCCGCATGGAG 

GAGTATGAGAGGAACATCGAGGAGATGGTGGC CCAGCT 

CAGAGAAAGTGTGAGGTCAACTTGCAGCTGTGGATGTCCAACAAGAGGAGCCTCT 

TGGGGCTACAGCATCAACCACGACCCCAGCCGTATCCCCGTGGACCTGCCGGAGGCACGG 

TGCCTGTGTCTGGGCTGTGTGAACCCCTTCACCATGCAGGAGGA 

GTGCCGGTGTTCAGCCAGGTTCCTGTGCGCCGCCGCCTCTGCCCGCCACCGCCCCGCACA 
GGGCCTTGCCGCCAGCGCGCAGTCATGGAGACCATCGCTGTGGGCTGCACCTGCATCTTC 
TC^T.CACCTGGCCCAGAAGCCAGGCCAGCAGCCCGAGACCATCCTCCTTGCACC 1 HUT 
GCCAAGAAAGGCCTATGAAAAGTAAACACTGACTTTTGAAAGCAAG 
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Figure 60 
MDWPHNI£FU,TISIFLGIX3QPRSPKKRXGQG^ 

E YERNI EEMVAQLRNS S EIJ^QRKCEVNI^QL WMSNKRSLS PWGY S INHDPS R I PVDL PEAR 
CLCLGCVNPFTMQEDRSMVSVPVFSQVPVRRRLCPPPPRTGPCRQRAVMETIAVGCTCIF 
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Figure 61 

AGATGGTCAACGACCGGTGGAAGACCATGGGCGGCGC^ 

GCGACAAGCCGCAGCGGCCGAGCTGCGGCTACGTGCTGTGCACCGTGCTGCTGGCCCTGG 
CTGTGCTGCTGGCTGTAGCTGTCACC 

GGTGCCGTGCTCTTCCTGAACCACGCCCACGCGCCGGGCACGGCGCCCCCACCrGTCGTC 
AGCACTGGGGCTGCCAGCGCCAACAGCGCCCTGGTCACTC 
CACCTCAGCATCCTCATTGACCCGCGCTGCCCCGACCTC^^ 
GAGAGCGCCCAGGCCTCGGTGCTGCAGGCGCTGACAGAGCACCM 

GTGGGCGACCAGGAGCAGGAGCTGCTGGACACGCTGGCCGACCAGCTGCCCCGGCTGCTG 

GCCCGAGCCTCAGAGCTGCAGACGGAGTGCATGGGGCTGCGGAAGGGGCATGGCAOT 

GGCCAGGGCCTCAGCGCCCTGCAGAGTGAGCAGGGCCGC^ 

AGCCAGGGCCACATGGCTCACCTGGTGAACTCCGTCAGCGACATCCTGGATG 

AGGGACCGGGGGCTGGGCCGGCCCCGCAACAAGGCCGACCTTCAGAGAGCGCCTGCCCX5G 

GGAACCCGGCCCCGGGGCTGTGCCACTGGCTCCCGGCCCCGAGACTGTCTGGACGTCCTC 

CTAAG CGGACAGCAGGACGATGGCGTCTACTCTGTCTTTC CCACCCACTACCCGGCCGG C 

TTCCAGGTGTACTGTGACATGCGCACGGACGGCGGCGGCTGGACGGTGTTTCAGCGCCGG 

GAGGACGGCTCCGTGAACTTCTTCCGGGGCTGGGACGCGT AC CGAGACGG CTTTGGCAG G 

CTCACCGGGGAGCACTGGCTAGGGCTCAAGAGGATCCACGrc 

TACGAGCTGCACGTGGACCTGGAGGACTTTGAGAATGGCACGGCCTATGCCCGCTACG^ 
AGCTrCGGCGTGGGCTTGTTCTCCGTGGACCCTGAGGAAGACGGGTACCCGCTCACCGTG 
GCTGACTIATTCCGGCACTGC^VGGCGACTCCCTCCTGAAGCA.CAG 

ACCAAGGACCGTGACAGCGACCATTCAGAGAACAACTGTGCCGCCTTCTACCGCGGTGCC 

TGGTGGTACCGCAACTGCCACACGTCCAACCTCAATGGGCAGTACCTGCGCGGTGCGCAC 

GCCTCCTATGCCGACGGCGTGGAGTGGTCCTCCTGGACCGGCTGGCAGTACTCACTCAAG 

TTCTCTGAGATGAAGATCCGGCCGGTCCGGGAGGACCGCTAGACTGGTGCACCTTCT 

TGGCCCTGCTGGTCCCTGTCGCCCCATCCCCGACC^ 

TCCACCCACCTGTGCCTGGCGGACCCACTCTCCAGTAGGGAGGGGCCGGGCCATCCCTGA 
CACGJ^GCTCCCTGGGCCGGTGAAGTCACACATCGCCTTCTCGCCGTCCCCACCCCCTCC 
ATTTGGCAGCTCACTGATCTCTTGCCTCTGCTG^ 

AACTCCTGCCTGCCCCCACTGTGACTCCGGTGCTGTTTGCCGTCCCCTGGCCAGGATGCT 

GGAGTCTGCCCCAGGCACCCTCTGCCCTGCCCGGCCAAATACCCGGCATTATGGGGACAG 

AGAGCAGGGGGCAGACAGCACCCCTGGAGTCCTCCTAGCAGATC^ 

CTCTCTGAGGTCAGGTCTGAGGCCAGTATCCTCCAGCCCTCCCAATGCCAACCCCCACCC 

CGTTTCCCTGGTGCCCAGAGAACCCACCTCTCCCCCAA 

GGGCCTCAGCCTGGCTGTGGGCTGGGTGGCCCCATCCTACCAGGCCCTGAGGTCAGGATC 

GGGAGCTGCIXXrCTTTGGGGACCCACGCTCCAAGGCTGAGACCAGTTCCCTGGAG 

CCACCCTGTGCCCCGGCAGGCCTGGGGTCTGCAGTCCTCTTACCTGCTGTGCCCACCTGC 

TCTCTGTCTCAAATGAGGCCC^CCCATCCCCCACCCAGCTCCCGGCCGTCCTCCTACCT 

GGGGCAGCCGGGGCTGCCATCCCATTTCTCCTGCCTCTGGAAGGTGGGTGGGGCCCTGCA 

CCGTGGGGCTGGACTGCGCTAATGGGAAGCTCTTGGTTTT 

GGCTGGGATGAGGCTTGTACAACCCCCACCACCAATTTCCCAG 

GAGGCCAACCCTTGCCATTGACCGTGGCCACCTGGACCCAGGCCAGGCCCGGCCCOT 
GTGGTCAAGGGACAGGGACCACCTCACCGGGCAAATGGGGTCGGGGGGACTGGGGCACCA 
GACCAGGCACCACCTGGACACTTTCTTGTTGAATCCT CC CAACACC CAG CACGCTGTCAT 
CCCCACTCCTTGTGTGCACACATGCAGAGGTGAGACCra 
CACAAGGGCAGGX3CTGGAGCCGGGTCCTCAGCTGTCTGCTCAG 

GCGTTACGTCAGGCCCAGATGCAGG GCGGCTTTTCCAAGGCCTCCTGATGGGGGCCTCCG 
AAAGGGCTGGAGTCAGCCTTGGGGAGCTGCCTAGCAGCCTCTCCTCGGGCAGGAGGGGAG 
GTGGCTTCCTCCAAAGGACACCCGATGGCAGGTGCCTAGGGGGTtf 

CCTTC CCCTC C CACTGAAGTTTGTG CTTAAAAAACAATAAATTTGACTTGG CACCACTGG 
GGGTTGGTGGGAGAGGCCGTGTGAC CTGGCTCTCTGTCCCAGTGCCACCAGGTCATCCAC 
ATGCGCAG 
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Figure 62 

MVNDRWXTMGGAAQLEDRPRDKPQRPS CGYVLCTVIJJ^AVIXAVAVTGAVLFIiNHAHAP 
GTAPP PVVSTGAASANSALVTVERADSSHLS I LIDPRCPDLTDSFARLESAQASVLQALT 
EHOAQPRLVGDQEOEIXDTlADQLPRIJiARASELQTECMGLRKGHGTLGQGLSAIiQSE^ 
RLIQI^SESQGHMAHLVNSVSDILDAIjQRDRGLGRPRNKADLQRAPARGTRPRGCATGSR 
PRDCUN^SGQQDDGVYSVFPTHYPAGFQVYOSMR^ 

AYRDGFGRLTGEHWKlJailHALTTQAAYELHVDLEDFENGTAYARYGSFGVGLFSVDPE 

EDGYPLTVADYSGTAGDSIJJCHSGMRFTTKDRDSDHSE1TOCAAFYRGA 

GQYLRGAHASYADGVEWSSWTGWQYSLKFSEMKIRPVREDR 
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Figure 63 
AGTGACTGCAGCCTTCCTAGATCCCCTCCACT^ 

AGCACCAGTGTGTGAGGGGAGCAGGCAGCGGTCCTAGCCAGTTCCTTGATCCTC 

CACCCAGCCCCCGGCACAGAGCTGCTCCACAGGCACC 

ATGAGGATCATGCTGCTATTCACAGCCATCCTGGCCTTCAGCCT 

C CAGATCTCTACCAGCTGCTCCAGAGACTCTTCAAAAGCCACTCATCTCT 

CTCAAAGCCCTGAGCC7VGGCTAGCACAGATCCTAAGGAATCAACATCTCCC 

GACATGCATGACTTCTTTGTGGGACTTATGGGCAAGAGGAGCGT 

ACAGGAC lTl 1 C T TACCTTCAGTGAGGGTTCCTCGGCCCCTTCATCCCAATCAGCTTGGA 

TCCACAGGAAAGTCTTCCCTGGGAACAGAGGAGCAGAGACCTTTATAAGACTCT 

GATGTGAATCAAGACiAACGTCCCCAGCTTTGGCATCCTCAAGTATCC 

TAGGTACTCCACTTCCGGACTCCTGGACTGCATTAGGAAGACCTCTTT^ 

CCCCAGGTGCGCACGCTCCTGTTACCCTTTCTCTTC 

TTTGACTCCTTCTCCATCTTTTCTACCTGACCCTGGTGTGG 

CC CAACC C CAATGGG CATTGACTGT AGAATAC C CTAGAGTT C CTGTAGTGTCCTACATTA 
AAAATATAATGTCTCTCTCTATTCCTCZAACAATAAAGGATTTTTGCATATGAAAAAAA^ 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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Figure 64 

MRIMI*LFTAIIAFSlAQSFGAVCKEPQEEVVPGGGRSKRDPDLyQUiQRLFKSHSSLEa 
LKALSQASTDPKESTSPEKRDMHDFFVGLMGKRSVQPEGKTGPFTjPSVRVPRPLHPNQLG 
STGKSSLGTEEQRPL 
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Figure 65 



GCCTTCTGCCTGC 

ATGGACGCTCTGAAGCCACCCTGTCTCTGGAGGAACCRCGAGCGAGGGAAGAAGGACAGG 

GACTCGTGTGGCAGGAAGAACTCAGAGCCGGGAAGCCCCCAT^ 

GATGCGGCCCCCTCGCAGGGT 

CTGAATTTCCTGCTGCTGTTCACAAAGATGCTTTTTA 

CTTCCGACCCCGGCGTTGATCTGCATCCTGACATTTGGAGCTC 

ATCACCAGACCTCAACCCGTCTTACCTCTTCTTGACCTGAACAATCAGTC 

GAGGGAGGAGCACGGAAGGGGGTTTCCCAGAAGAACAATGACCT^ 

TCAGATGCCAAGACTATGTATGAGGTTTT CCAAAGAGGACTCGCTGTGTCTGACAATGGG 

CCCTC^nTGGGATATAGAAAACCAAACCAGCCCTACAGATGGCT 

TCTGATAGAGCAGAGTACCTGGGTTCCTGTCT^ 

GACCAGTTTGTCGGCATCTTTGCTCAGAATAGGCCAGAGTGGATCATCrc 

TGTTACACGTACTCTATGGTAGCTG TAC CT CTGTATGACA C CTTGGGACCAGAAG CCATC 

GTACATATTGTCAACAAGGCTGATATCGCCATGGTGATCTGTGACACACCCCAAAAGGCA 

TTGGTGCTGATAGGGAATGTAGAGAAAGGCTT CACC CCGAG CCTGAAGGTGATCATCCTT 

ATGGACCCCTTTGATGATGACCTGAAGCAAAGAGGGGAGAAGAGTO 

TCCCTATATGATGCTGAGAACCTAGGCAAAGAGCACTTCAGAAAACCTGTGCCT 

CCAGAAGACCTGAGCGTCATCTGCTTCACCAGTGGGACCACAGGTGACCCCAAAGGAGCC 

ATGATAACCCATCAAAATATTGTTTC^AATGCTC 

GCTTATGAGCCCACTCCTGATGATGTGGCCATATCCTACCT 

GAGAGGATTGTT^CAGGCTGTTGTGTACAGCTGTGGAGCCAGAGTTGGATTCTTC CAAGGG 

GATATTCXSGTTGCTGGCTGACGACATGAAGACTTTGAAGCCCAC^ 

CCTCGACTCCTTAACAGGATCTACGATAAGGTACAAAATGAGGCCAAGACACC 

AAGTTCTTGTTGAAGCTGGCTGTTTCCAG^ 

AGGCATGATAGTTTCTGGGACAAGCTCATCTTTGCAAAGATC 

AGGGTTCGTGTAATTGTCACTGGAGCTGCCCCCATGT^ 

CGGGCAGCAATGGGATGTCAGGTGTATGAAGCTTATGGTCAAACAGAATGC^ 

TGTACATTTACATTACCTGGGGACTGGACATCAGGTC^ 

AATTACGTGAAGCTX3GAAGATGTGGCTGACATGAACTACTTT 

GAGGTCTGCATCAAGGGTACAAACGTGTTCAAAGGATACCTGAAGGACCCT 

CAGGAAGCCCTGGACAGTGATGGCTGGC . 

TTCACACAGGAGACATTGGTCGCTGGCTCCCGAATGGAACT^ 

AAAAGAACATTTTCAAGCTGGCCCAAGGAGAATACATTGCACCAGAGAAGA 

TCTACAACAGGAGTCAACCAGTGTTACAAATTTTTGT^ 

C CTTAGTAGGAGTGGTGGTTCCTGACACAGATG TACTTCCCTCATTTGCAGC CAAG CTTG 

GGGTGAAGGGCTCCTTTGAGGAACTGTGCCAAAACCAAGTTC 

AAGACTTGCAGAAAATTGGGAAAGAAAGTGGCCTTAAAACTTCT 

TTTITCTTCATCCAGAGCC&TTTTCCATTGAAAATGTC 

CAAAGCGAGGAGAGCTTTCCAAATACTTTCGGACCCAAATT^ 

TCCAGGACTAQGATAAGGTACTTAAGTACCTGCCGGCCCACTC 

AATGGATTAAAAACTATTCTTACATTTGTTTTGCCTTTCCTC 

TTAAACTCTAAAGCCATAGCTTTTGTTTTATATTGAGACATATAATG 

CCCAAATAAATCAATCCTGTCTTTCCCATCTTCGATGTTGCTAAT^ 

CTACTTTTATCAACATGCCTGTCTTCAAGATCCCAGTTTATGTT 

GATTTC CAA CCTTAATACT ATTAG TAAC CACAAGTTCAAGGGTCAAAGGG ACCCTCTGTT^ 

CCTTCTTCTTTGTTTTGTGATAAACATAACTTGCCAACAGTCTCTATG 

TTCTACTGTTCAAACTAAGAGATTTTTAAATTCTGAAAAACT 

CTAGCCACTCCACAAACCACTAAAATTTTAG TV 1TAGCCTATCACTCATG TCAATCATAT 

CTATGAGACAAATGTCTCCGATGCTCTTCTGCGTAAATT^^ 

AGTTTGATCATACCAAACATTTCCTAAACTCTCTAGTTAGATATCTGACT^ 

AAAAATTGGGTCTATGACATACTGTCCAAAAGGAATGCTGTTCT^ 

GTAGGAACTGGGGAGTAAATCTGTTCCCTACAGTTTGCTGCTGAGCTC 

GAAGGAGTTGACAGGTGGGCCCAGTGAACTTTTCCAGTAAATGAAGCAAG CACTGAATAA 

AAACCTCCTGAACTGGGAACAAAGATCTACAGGCAAGCAAGATC^ 
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Figure 66 

MDAIJCPPCLWRNHERGKroRDSCGRXN^ 

FNFLFS PLPTPALI CILTFGAAI FLWLI TRPQ PVL PLLDLNNQS VG I EGGARKGVSQKNN 

DLTSCCFSDAKTMYEVFQRGLAVSDNGPCIX?YRKPNQPYRWLSYKQVS 

KGYKSS PDQFVG I FAQNRPEWI I SELACYTYSMVAVPL YDTLGPEAI VHI VNKADI AMVI 

CDTPQECALVLIGNVEKGFTPSLKVTIliMDPFDDDLKORGEKSGIEILSLYDAENLGKBHF 

RKPVP PS PEDLSVI CFTSGTTGDP KGAMITHQNI VSNAAAFLKCVEHAYEPTPDDVAI S Y 

LPLAHMFER I VQAWYSCGARVGFFQGD I RLLADDMKTLKPTL F PA VP RLLNR I YDKVQN 

EAKTPLKKFLLKLAVSSKFKELQKGI I RHDS FWDKLI FAKI QDS LGGRVRVIVTGAAPMS 

TSVMTFFRAAMGCQVYEAYGQTECTGGCTFTLPGDWTSGHVGVPLACNYVKLEOT 

FTVNNEGEVCIKGTNVFKGYLKDPEKTQEALDSDGV^ 

FKIAQGEYIAPEKI ENI YNRSQFVLQ I FVHGES LRSSLVGWVPDTDVLPS FAAKLGVKG 

SFEELCQNQVVREAILEDLQKIGXESCaJCTFEQVK^ 

ELSKYFRTQIDSLYEHIQD 
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Figure 67 

GAAAGA 

ATC TflntSGCTGCTLTlTTTTLT^ 
GAACTCTGTCAACCAGGTGCAGAAAATGCTTTTAAA^ 

CTGGGAGATAAAGCATATGCCTGGGATAC CAATGAAGAATACCTCTTCAAAGCGATGGTA 

GCTTTCTCCATGAGAAAAGTTCCCAACAGAGAAGCAACAGAAATTTCCCATGT 

TGCAATGTAACCCAGAGGGTATCATTCTGGTTTGTGGTTACAG^ 

ACCCTTCCTGCTGTTGAGGTGCAATCAGCCATAAGAATGAAC^ 

GCCTTCTTTCTAAATGACCAAACTCTCGAATTTCT 

CCCATGGACCCATCTGTGCCCATCTGGATTATTATATTTGGTGTGATA 

A1AGTTGCAATTGCACTACTGATTTTATCAGGGATCTGGCAACCT 

GAACCATCTGAAGTGGATGACGCTGAAGATAAGTGTGAAAACATGATCAC^ 

GGCATCCCCTCTGATCCCCTGGACATGAAGGGGGGCATATTAATGATGCCTTC A 

AGGATGAGAGGCTCACCCCTCTCTGAAGGGCTGTTGTTCTGCTTCCT 

ATTTGTTTCTGTGTGACTGCTGAGCATCCTGAAATACCAAGAGC^ 

TTCACCATTCTTCTTTTGTAATAAATTTTGAATGTGCT 

ATACCCACCAACACCACTGAAATCATAAGCTATTCACGACTCAAAATATTCT 

TTTCTGACAGTATAGTG 

TATAAATGTGGT CATGTGG TATTTG TAGTTATTGA TTTAAGCATTTTT AGAAATAAGATC 
AGGCATATGTATATATTTTCACACTTOU^^ 

GAATACATATAATATGGTGTAGAAATCATTGAAAATGGATCCTTTTTGACGATCACT 

ATCACTCTGrTATATGACTAAGTAAACAAAAGTGAGAAGTAATTATTGTAAATGGATGGAT 

AAAAATGGAATTACTCATATACAGGGTGGAATTTTATCCrrGTTATCA 

TTATATATTTTCTGAATATCAGCCCCTAATAGGACAATTCTATTTGTTGA 

AATTTGTAAAAG TC CAA TCTGTG CTAACTTAATAAAGTAAT AAT CATCT CI ^ ITIT AAAAA 

AAAAAAAAAAAAAAAAAAAAA 
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Figure 68 

MLWLLFFXVTAIHAELCQPGLAENAJTKV^ 

VPNREATE I SHVLLCNVTQRVS FWFWTDPS KNHTLPAVEVQS AI RMHKNR INNAFFLND 
QTLEFLKI PSTLAPPMDPSVPIWI 1 1 FGVIFCI I IVAIALLILSGIWQRRRKNKEPSEVD 
DAEDKCENMITIENGIPSDPLDMKGGILMMPS 
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Figure 69 

AACTCAAACTCCTCTCTCTGGGAAAACGCGGTGCTTGCTCCTCCCGGA^ 

GGGTGTTGGAGCCCTCGGTCTGCCCCGTCCGGTCTCTGK3GGCCA 

ATCTATGGCAAGAGCTCTACTCGTGCGGTGCTTCTTCT 

CTTTGGCCTATAGCAGCTGTGGAAATTTATAC CTCC CGGGTGCTGGAGGCTGTTAATGGG 
ACAGATGCTCGGTTAAAATGCACTTTCT CCAG ClTl'GCCCCTGTGGGTGATGCTCTAACA 
GTGACCTGGAATTTTCGTCCTCTAGACGGGGGAC CTGAGCAGTTTGTATTCTACTACCAC 
ATAGATCCCTTCCAACCCATGAGTGGGCGGTTTAAGGA 
CCTGAGCGGTACG&TGCCTCCATCCTTCTCTGGAAACTGCAGTT 
TACACCTGCCAGGTGAAGAACCCACCTGATGTTGATGGGGTGATAGGGGAGATC 
AGCGTCGTGCACACTGTACGCTTCTCTCAGATCCACTTCCTGGCTCT 
GCCTGTGCACTGATGATCATT^TAGTAATTGTAGTGGTCCT 

AAGCGATGGGCCGAAAGAGCTCATAAAGTGGTGGAGATAAAATCAAAAGAAGAGGAAAGG 

CTCAACCAAGAGAAAAAGGTCTCTGTTTATTTAGAAGACACAGA 

GAAGCTGAGATGATTTCCAAGAACAAGAACCCTAGTATTTCTTC 

TTCTTTGGCTTTTCCAGTTGTGACCCGTTTTC 

AGAC^GCAACACCCCTCTGGAGCCAGCACAGTGCTCCTC 

GCCTCATTATTAAGGTCTTATTTAATTTCAGAGTGTAA 

ATTTTTTCAAGTGCTCATTAGGTTTTATAAACAAGAAGCT 

CTACTTACAGTGTTATGACTTGTATACACATATATTC^ 

TTTGTCTGTTACATTTCCTTTCACGTATTTCTTT^ 

ATGTGTTTACTCTCTTTCCTTCCCACATTCTCAATTAAAAGGTGAGCT 

TGTTT CTGATTAACAGTAAATC CTAAATT CAAACT GTTAAATGACATTTTTATTTTTATG 

TCTCTCCTTAACTATGAGACACAT C ITG ITT I ' A CTGAATTTCl'TTCAATATTCCAGGTGA 

TAGATTTTTGTCG 
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Figure 70 

MYGKS S TRA VLLLLG I QLTALWP I AAVE I YT S R VL EAVNG TDARL KCTFS S FAP VGD ALT 
VTWNFRPLIXjGPEQFVFYYHIDPFQPMSGRFKDRVSWDGNPERYDASILLWKLQFDDNGT 
YTCQVKNPPDVDGVI GE IRLS WHTVRFS EIHFLALAI GSACALMI 1 1 VI WVLFQHYRK 
KRWAERAHKWEIKSKEEERLNQEKKVSVYLEDTD 



i. 



72 / 133 



(0 2003 Copyright Derwent Information Ltd. 



WO 00/53758 



PCT/US00/0S841 



Figure 71 

AAGCAACCAAACTGCAAGCTTTGGGAGTTGTTCGCTGTCCCTGCCCTGCT 

GAGAACGCCAGAGGGAGGCGGCTGGCCCGGCGGCAGGCTCTCAGAACCXjCT 

ATCSCTACTGCTGTGGGTGTCGGTGGTCGCAGCCTTGGCGCTGGCGCT 

GCAGGGGAGCAGAGGCGGAGAGCAGCCAAAGCGCCCAATGTGGTG 

CTGGTCGTGAGCGACTCCTTCGATGGAAGGTTAACATTT^ 

AAACTTCCTTTTATCAACTTTATGAAGACACGTGGG 

AACTCTCC3ATTTGTTGCCCATCACGCGCAGCAATGTGGAGTGGCCT 

ACAGAATCTTGGAATAATTTTAAGGGTCTAGATCCAAACT 

ATGGAGAGGCATGGCTACCGAACACAGAAATTTGGGAAACTGGAC^^ 

CACTCCATTAGTAATCGTGTGGAAGCGTGGACAAGAGATGTTGC1TTC 

GAAGGCAGGCCCATGGTTAATCTTATCCGTAACAGGACrAAAGTCAGA 

GArTGGCAGAATACAGACAAAGCAGTAAACTGGTTAAGAAAGGAAGCAATTA^ 

GAACCA T 1T G1 T A TTTACTTGGGATTAAATTTACCACACCCTTACCCTTCATO 

GGAGAAAATTTTGGATCTTCAACAITTCACACATCTCTTTATTC CTTGAAAAAGTGTCT 

CATGATGCCATCAAAATCCCAAAGTGGTCACCTTTGTCAGAAATGCACCCTGTAGATTAT 

TACTCTTCTTATACAAAAAACTGCACTGGAAGATTT^ 

AGAGCATTTTATTATGCTATGTGTGCTGAGACAGATG CCATG CTTGGTGAAATTATTTTG 
G C C CTT CATCAATTAGATC'l'l'Cl "1*CAGAAAACTATTGTCATATACTCCTCAGAC CATGGA 
GAGCTGGCCATGGAACATCGACAGTTTTATAAAATGAGCATGTACGAGGCTAGTGCACAT 
GTTC CG CTTTTGATGATGGGAC CAGGAATTAAA G C CGG CCTACAAGTATCAAATGTG GTT 
TCTCTTGTGGATATTTACCCTACCATGCTTGATAT 

CTGAGTGGATACTCTTTGTTGCCGTTATCATCAGAAACATTTAAGAATGM 

AAAAACCTGCATCCACCCTGGATTCTGAGTGAATTCCATGGATGTAATGTG 

ACCTACATGCTTCGAACTAACCACTGGAAATATATAGCCTATTCGGAT^ 

TTGCCTCAACTCTTTGATCTTTCCTCGGATCCAGATGAATTAACAAATC 

TTTCCAGAAATTACTTATTCTTTGGATCAGAAGCTTCATO 

bTUTLlX^lTCTGTCCACCAGTATAATAAAGAGC^ 

GGACAGAATTATTCAAACGTTATAGCAAATCTTAGGTGGCACGAAGACT 

CCAAGGAAGTATGAAAATGCAATTGATCAGTGGCTTAAAACCCATA 

GTTTTQAACAAAAAGTTTAAAAATAGTGTTCTAGAGATACATATAAATATACT 

ATAATTATGTATTTTAAATGAAACAGTTTTAATAATTACCAAGTTTTGG CCGGGCACAG T 

GGCTCACACCTGTAATCCCAGGACTTTGGGAGGCTGAGG 

AGATTGAGACCATCCTGGCCAACATGGTGAAACCCTGTCTCTACTAAAA^ 

AGCTGGGCGCGGTGGTGCACACCTATAGTCTC^GCTACT 

CGCTTGAACCCGGGAGGCAGCAGTTGCAGTGAGCTGAGATTGCGCC^ 

TGGCAACAGAGTGAGACTGTGTCGrCAAAAAAATAAAAATAAAATAATAATAATTACCAAT 

TT1TCATTATTTTGTAAG AATG TAGTGTATTTTAAGATAAAATGCCAATGATTATAAAAT 

CACATATTTTCAAAAATGGTTATTATTTAGGCCTTTGTACAA 

AAGTATCAAAAGGATTCAAGCAAATACTGTAACAGTTATGTTC 

TATAAAATATTGTAATAATATGTATCATAAAATAGTTGTATGTGAGCATTTGA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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Figure 72 

MLLLWVSWAALALAVLAPGLMSEQRRRAAK^ 
NFMKTRGTSFLNAYTNSPICCPSRAAW^ 

YRTQKFGKLDYTSGHHS I SNRVEAWTRDVAFLLRQEGRPMVN1»I ENRTKVRVMERDWQNT 
DKAVNWLRKEAINYTEPFVIYLGLNLPHPYPS 

IPKWSPLSEMHPVDYYSSYTKNCTGRFTKKEIKNIRAFYYAMCAET^^ 
DLI^CTIVIYSSDHGELAMEHRQFYKMSMYEAS^ 

YPTMLD LAG I PLPQNLSGYS I»L PL S SETFKNEHKVKNLHP PW I LSEFHGCNVNAS TYMLR 
TNHWKY I AYSDGAS IL PQLFDLSSDPDELTNVAVKFPE I TYSLDQKLHS I INYPKVSASV 
HQYNKEQF I KWKQS I GQNYSNVI ANLRWH QDWQKEPRKYENAI DQWLKTHMNPRAV 
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Figure 73 

GACGCCCAGTGACCTGCCGAGGTCGGCAGCACAGAGCTCTGGAG 

ATOAAGACCCTGTTCCTGGGTGTCACGCTCGGCCTGGCCGCTGCCCTGTC 

GAGGAGGAGGATATCACAGGGACCTGGTACGTGAAGGCCATGGTGGTC^ 

CCGGAGGACAGGAGGCCCAGGAAGGTGTCCCCAGTGAAGGTGACAGCCCTGGGCGGTGGG 

AAGTTGGAAGCttCGTTCACCTTCATGAGGGAGG^ 

ATGCGGAAGACGGAGGAGCCTGGCAAATACAG CGC CTATGGGGGCAGGAAGCTCATGTAC 

CTGCAGGAGCTGCCCAGGAGGGACCACTACATCTTTTACTG CAAAGACCAGCACCATGGG 

GGCCTGCTCCACATGGGAAAGCTTGTGGGTAGGAATTCrGATACCAACCGGGA 

GAAGAATTTAAGAAATTGGTGCAGCGCAAGGGACTCTCGGAGGAGGA 

CTGCAGACGGGAAGCTGCGTTCCCGAACZACTAGGCAGCCCCCGGGTCTGCACCTCCAGAG 

CCCACCCTACCACCAGAttC^GAGCCCGGACCACCTGGACCniCCCTC 

TTCCCTCCTCCCACCCACCTGACTCOUUITAAAGTCCTTTTCCC 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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Figure 74 

MKTLFIXTVTLGKAAALSFTLEEED^ PVKVTALGGG 
KLEATFTFMREDRCI QKKI L^4IUCTEEPGKy SAYGGRi^*MVT#QELPRRDHy I FYCKDQHHG 
GLLHMGKLVGRNSDTNREALEE FKKLVQRKGLSEEDI FTPLQTGS CVPEH 
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Figure 75 

CTGGGATCAGCCACTGCAGCTCCCT 

ATt^GGAGKaCTCCTCCTGGTCACCAGCCTGGTGGTTGTGCTG 

GTCCCAGCACCCAAGGTCCCTATCAAGATGCAAGTCAAACACTGGCCCTCAGAGCAGGAC 

CC^GAGAAGGCCTGGGGCGCCCGTGTGGTGGAGCCTCCGGAGAAG^ 

GTGCTGTTCCCTGTCCAGAAGCCGAAACTCTTGACC^^ 

GGCAGGGGCCCCATCCTTCCAGGCACCAAGGCCTGGATGGAGACCGAGGACACCCTGGGC 

CGTGTCCTGAGTCCCGAGCCCGACCATCACAGCCTGTACCACCCTCCGCCTGAGGAGGAC 

C1AGGGCGAGGAGAGGCCCCGGTTGTGGGTGATGCCAAATCACCAGGTC 

GAGGAAGACCAAGACCACATCTACCACCCGCAGTAGGG CTCCAGGGGCCATCACTGCCCC 

CGCCCTGTTCCCAAGGCCCAjGGCTGTTGGGACTGGGACCCTCCCT^ 

ACAAATAAACCCCAGCAGGCAAAAAAAAAAAAAAAAAAA 
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Figure 76 

MRRLLLVTSLVVVLLWEAGAVPAPK\^I KMQVKHWPS EQDPEKAWGAR WEP PEXDDQLV 
VLFPVQKPKLLTTEEKPRGQGRGPILPGTKAW>fETEDTI*GRVLS PE PDHDS LYHPPPEED 
QGEERPRLWVMPNHQVLLGPE EDQDHI YHPQ 



r 

X 
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Figure 77 

GGGAGAGAGGATAAATAGCAGCGTGGCTTCC CTGGCTCCTCTCTGCATCCTTCCCGACCT 

TCCCAGCAATATGCATCTTGCACGTCTGGTCGGCTCCTGCTCCCTCCTTCTC 

GGCCCTGTCTGG 

ATGGGCGGCCAGCGATGACCCCATTGAGAAGGTCATTGAAGGGA 

CAATGCAGAGAGAGAGGTGGGCAAGGCCCTGGATGGCATCAACAGTGGA^ 

CGGAAGGGAAGTGGAGAAGGTTTTCAACGGACTTAGCAACATGG^ 

GGAGTTGGACAAAGGCGTCCAGGGGCTCAACCACGGCATGGACAAGGTTC 

CAACCATGGTATTGGACAAGCAGGAAAGGAAGCAGAGAAGCTTGGCCAT^^ 

CGCTGCTGGACAGGCCGGGtfUUSGAAGCAGACS^ 

CCACCAGGCTGGGAAGGAAGCAGAGAAACTTGGCCAAGGGGTCAAC 

GGCTC^AAAGGAAGTGGAGAAGCTTGGCCAAGGTGCCCACCATC 

GAAGGAGCTGCAGLAATGCTCATAATGGGGTCAACCAAGC 

GCTGAATGGCAACCATCAAAGCGGATCTTCCAGCCATCAAGGAGGGGCCA 

GTTAGCCTCTGGGGCCTCAGTCAACACGCCTTTCATCAAC 

CGTCGCCAACATCATGCCCT^ACTGGCATCCGGCCTTGCTGGGAGAATAATGTCGCCGT 

TGTCACATCAGCTGACATGACCTGGAGGGGTTGGGGGTGGGGGACAGGTTTCT 

CTGAAGGGGGTTGTACTGGGATTTGTGAATAAACTTGATACACCA 
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Figure 78 

MHLARL VGS CS LLLLLGALS GWAASDDP I EKVT EGINRGLS NAER EVGKALDGINSG I TH 
AGREVEKVFNGLSNMGSHTGKEIiDKGVQGLNHGMDKVAHE 

NAAGQAGKEADKAVQGFHTGVHQAGKEAE KXGQGVNHAADOAGKEVEKLGQGAHHAAGQA 

GKELQNAHNGVNQASKEANQLl^G^QSGSSSHQGGATTTPIJ^ 

SVANIMP 
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Figure 79 

GGACAACCGTTGCTGGGTCTC^ 

CCTTTCGGCCnTGAGGTTCCCAGCCTGGTGGCCCC^^ 

TGCTACGGACGACGCCT 

ATCAAGCCCTTAGTCCTTCTAGTTGCGCTTTTGCTATGGCCT 

CCGAGCATAACTGTGACACCTGATGAAGAGCAAAACCT 

GAGAACCTAGTACGAAGTGTTCCCTCTGGGGAGCCAGCT 

CCAAAACATGTTTATTCTATAGCATCAAAGGGATCAAAAT^ 

GGAGACGCTTCAACTGAGAATGATGTTTTAACCAATCC^^ 

TTCCCTACAGGAGGCTTCACACCGGAAATAGGAAAGAAAAAACACACGGAAAG 

TTCTGGTCGATCAAACCAAACAATGTTTCCATTGT^ 

GAAA.ATGAAGAGCCAGAGCCAGAGC CGGAGCCAGCTGCAAAACAAACTGAGGCACCAAGA 
ATGTTGCCAGTTGTTACTGAATCATCTACAAGTC^ 

GTCACCACTTTAGATAAGAGCACTGGCATTGAGATCTCTACAGAATCAGAAG^ 

CAGCTCTCAGGTGAAACTGCGATAGAAAAACCCGAAGAGTTTGGAAAGCA 

TGGAATAATGATGACATTTTGAAAAAAATTTTAGATATTAATT» 

CTT CTTAGTGACACCAGCAACCCAG CATATAGAGAAGATATTGAAG CCTCTAAAGATCAC 

CTAAAACGAAGCCTTGCTCTAGCAGCAGCAGCAGAACATAAATTAAAAAC^ 

TCCCAGTTATTGCCAGTAGGACGAACAAGTAATAAAATTGATGACATCGAAACTC 

AACATGCTGTGTAATTCTAGATCTAAACTCTATGAATATTTAGATATTAAAT^ 

CCAGAGATGAGAGAAAAAGCTGCTACAGTATTCAATACATTAAAAAATATGTGTAGATC^ 

AGGAGAGTCACAGCCTTATTAAAAGTTTATTAAACAATAATATAAAAATTTt 

TTGATATTCCATAAC^AAGCTGATTTAAGCAAACT 

ATATTCGTAATTTCAAAAGTTGTATAAAAATATTTTCTATTC 

ATCTTTATGTGTCATGTGTTATGAACAATTTTCATATC 

AAAATTTTGGTTCAGGAAAAAA 
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Figure 80 

MKPLVLLVALLLWPSS VPAYPS I TVTPDEEQNLNHYI QVLENLVRSVPSGEPGREKKSNS 

PKHVYS IASKGSKFKELVTHGDASTEiroVLTNPISEETTTPPTGGFTPEIGKKKHTESTP 

F^SIKPNOTSIVI^EEPyiENEEPEPEPEPAAKQTEAPRmPVVTESSTSPYVTSYKSP 

VTTLDKSTGIEISTESEDVPQLSGETAIEKPEEFGKHPESWNNDDILKKILDINSQVQQA 

LI^DTSNPAYREDIEASKDHLKRSLALAAAAEHKLI^^ 

NMLCNSRSiQ*YEYLDI KCVP PEMREKAATVFNTLKNMCRSRRVTAI*LKVY 
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Figure 8! 

AAAGTTGAGAGCAGC 

AT tf i ' lTlt^ CCfrCTGAA&CTCATCCTGCT 

AATGACTTGAATOITTCCCCGCCTGAGCTAACA 

GGATGTGTTTTCC3VGAGCACAGAAGACAAATGTATATTCAAGA 

CCAGGAGAGCACGCCAAGGACGAATATGTGCTATACT^^ 

ATTGGGCGCTTCCAGAACCGCGTACACTTGATGGGGGACATCTTATC 

CTCCTGCTCCAAGATGTGCAAGAGGCTGACCAGGGAACC^^ 

AAAGGGGAGAGCCAGGTGTTCAAGAAGGCGGTGGTACTGCATGTGCTTCC^ 

AAAGAGCTCATGGTC CATGTGGGTGGATTGATTCAGATGGGATGTGTTTTCCAGAGCACA 

GAAGTGAAACACGTGACCAAGGTAGAATGGAT ATTTTCA GGACGGCGCG 

ATTGTATTTCGTTACTACCACAAA CTCAGGATGT CTGTGGAGTACTC CCAGAG CTGGGGC 

CACTTCCAGAATCGTGTGAACCTGGTGGGGGACATTTTCCGCAATGACGGTTCCATCATC 

CTTCAAGGAGTGAGGGAGTCAGATGGAGGAAACTACACCTGCAGTATCCA 

CTGGTGTTCAAGAAAACCATTGTGCTGCATGT^ 

T CAGTTGGTGATCAl"lXaTGGGAAl"lXJ , rClXiltjKrCACAATCCTG CTG CTC C CTG '1TCTG AT 

ATTGATCGTGAAGAAGACCTGTCGAAATAAGAGTTCAGTGAATTCT 

GAACACGAAGAAGACTAATCCAGAGATAAAAGAAAAACCCTGCCATTTTGA 

AGGGGAGAAACACATTTACTCCCC^TAATTGTACGGGAGGTGATCGAGGAAGAAGAACC 

AAGTGAAAAATCAG&GGCCACCTACATGACCATGCACCCAGTTTG^ 

AGATCGGAACAACTCACTTGAAAAAAAGTCAGGTGGGGGAATGCCAAAAAC^ 

CTTTTGAGAAGAATGGAGAGTCCCTTCATCTCAG 

TGTCCTGGGCCACTCTACCAGTGATTTCAGACTCCCGCTCTCCCAGCTGTCCrCCTCT 

CATTGTTTGGT*CAATACACTGAAGATGGAGAATTTGGAG CCTGGCAGAGAGACTGGACAG 

CTCTGGAGGAACAGGCCTCCTGAGGGGAGGGGAGCATGGACTTGGCCT 

ACTGGCCCTGGGAACCAGGCTGAGCTGAGTGGCCTCAAACCCCCCGTTGG^ 

CCTGTGGGCAGGGTTCTTAGTGGATGAGTTACTGGGAAGAATCAGAGATAAAAACCAA 

CAAATCAA 
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Figure 82 

KFCPLKLIIJjPVIJ^YSLGLNDLOTSPPELTVH^ 
PGEHAKDEYVLYYYSNLSVPIGRFQNRVHLMGDILCND^ 

KGESQVFXKAVVLHVLPEEPKELMVHVGGLIQ^ FSGRRAKEE 
I VTR YYHXLRMSVEYSQS WGHFQNR VNLVGD I FRNDGS I MLQGVRES DGGNYTCS I HLGN 
LVFKKTIVI^SPEEPRTLVTPAALRPLVIXSGNQLV^^ 

GNKSSVNSTVLVKNTKKTNPEI KEKPCHFERCEGEKHI YS P 1 1 VREVI E EEEPSEKSEAT 
YMTMHPVWPSI«RSDRMNSLEKXSGGGMPKTQQAF 
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Figure 83 

GTCGAAGGTTATAAAAGCTTCCAGCC 

CACAGCCTGAGATCTTGGGGATCCCTCAGCCTAACACCCACAGA.CCTCAGCT 
CCCGCTGCATCAAGGCCTACCCACTGTCTCC 

CAGCCCTTGGCCCCTCAAGACTTTGAAGAAGAGGAGGCAGATG 

CCGCCTTTGCCGGCTGTCCCCTGCGACTACGACCACTGCCGACACCTGCAGGTGC^ 

AAGGAGCTACAGAGGGTCGGGCCGGCGGCCTGCCTGTGC^ 

CAGCCGCCCGACCCGCCGCGCATGGGAGAAGTGCGCATTGCGGCCGAAGAGGGCCGCGCA 
GTGGTCCACTCGTGTGCCCCCTTCTCCCCGGTCCTCCACT 

GGCAGCGAGGCTGCGCAGAAGGGGCCCCCGCTGAACGCTACGGTCCGCAGAGCCGAA^ 
AAGGGGCTGJUU^CAGGGGGCATTTATGra 

GCAAGCCGCGTGCCCCAGGCTGGAGGAGAGGGCCTCGAGGGGGCCGACATCCCTGCCTTC 

GGGCCTTGCAGCCGCCTTGCGGTGCCGCCCAACCCCCGCACTCTGGTCCACGCGGCCGTC 

GGGGTGGGCACGGCCCTGGCCCTGCTAAGCTGTGCCGCCCTGGTCTGGCACTTCT 

CG CGATCGCTGGGGCTGC CCGCGCCGAGCCGCCGCCCGAGCCGCRGGGGCGCTCTGAAAG 

GGGCCTGGGGGCATCTCGGG»CAGACAGCCCCACC^^ 

GAAAGAGGAAAACCCGCTGCCTCCAGGGAGGGCTGGACGGC^^ 

GGCTCCAGGGCCACGGCGGAGTCATGGTTCTCAGGACTGAGCGCTTGTTT^ 

CTTGGCGCTTTGTTTCCTGGCTGAGGTCrGGGAAGGAATAGAAAGGGGCCCCCA A T T ITT 

TTTTAAGCGGCCZAGATAATAAATAATGTAA.CCTTTGCGGTTAAAAAAAAA 
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Figure 84 

MLGSPCLLWLIAVTFLVPRAQPIAPQDFEE 

KEI^RVGPAACLCPGLSSPAQPPDPPRMGEVRIAAEEGRAVraWCAPFSFVLHYWI^ 
GSEAAQKGPPLNATVRRAELKGLKPGGI YWCWAANEAGASRVPQAGGBGLEGADI PAF 
GPCSRIAVPPNPRTLVHAAVCTGTMJ^L^ 
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Figure 85 



CGACG 




TCGTCGCTCArcCCCTATTTCGGCACC^ 

TTGTCGGGCCCCGAGGCTCCGTGGCGGGACCCTGAGCTGCTGGAGGGGACCTGCACCCCG 

GTGCAGCTGGTCGCCCTCAfTCGCCACGGCACCCGCTACCCCACGG 

AAGCTGAGGCAGCTGCACGGGTTGCTGCAGGCCCGCGGGTC CAGGGATGGCGGGGCTAGT 

AGTACCGGCAGCCGCGACCTGGGTGCAGCGCTGGCCGACTGGCCTTTGTC 

TGGATGGACGGGCAGCTAGTAGAGAAGGGACGGCAGGATATGCGAC^ 

CTGGCCTCGCTCTTCCCGGCCCTTTTCAGCCGTGAGAACTAro 

ACCAGTTCCAAGCACCGCTGCATGGATAGCAGCGCCGCCTTCCTGCAGGGGCTG^ 

CACTACCACXTCTGGCTTGCCGCCGCCGGACGTCGCAGATATGGLAGTTTG 

G TTAATGATAAACTAATGAGATTITTTGATCACTGTGAGAAG'X' T V1TAACTGAAGTAGAA 

AAAAATGCTACAGCTCTTTATCACGTGGAAGCCTTCAAAACTGGACCAG 

ATTTTAAAAAAAGTTGCAGCTACTTTGCAAGTGCCAGTAAATGATTTAAATC 

ATTCAAGTAGCCTTTTTCACCTGTTCATTTG&^ 

TGGTGTGATGTITTTGACATAGATGATGCAAAGGTATTAGAAT^ 

CAATATTCGAAAAGAGGATATGGGTATACTATTAAGACT 

CAGGATATCTTTCAG CACTTGGACAAAG CAGTTGAACAGAAACAAAGGTCTCAGC CAATT 

TCTTCTCCAGTGATCCTCCAGTTTGGTCATGGAGAG&CTCCT 

ATGGGCTACTTCAAAGACAAGGAACCCCTAACAGCGTAC^ 

CGGAAGTTCCGAAGTGGTCTCATTGTACCTTATGCCTCGAACCTGATATTT^ 

CACTGTGAAAATGCTAAGACTCCTAAAGAACAATTCro 

AAGGTGTTACCTTTGGCTTACTCACAAGAAACTGTTTCATTTTATC 

CACTACAAGGACATCCTTCAGAGTTGTCAAACCAGTG 

AACAGTACATCTGATGAACTATGAGTAACTGAAGAACA Tri" 

CAATGAGTGATT7VCATGCTTGTAATAGGTAGGCAATTC 

ATTACTTGAGTATTTCTGTCTTTTCACAGAAAAACA1TGGGTTTCTCT 

ATGAAATGTAAGAAAAGATTTTTCACTGGAGCAGCTCTCTTAAGGA 

TAGAGAAACAGCTGGCCCTGCAAATGTTTACAGAAATGAAATTCTT 

AAATCTCACACTGAGATAGAATTGTGATTTCATAAT^ 

CAAAATATCTCAGTTGGACCATCCTTAACTTGATTGAACT 

GTTCTGCAGTTCTCTCTTCTTTTCCTCAGGTAGGACA 

AATATTGTGGTAAGCTGGGAGTATCACTCTGGAAGAAAGTAACATCTC 

TGAAACAAGAAACAGAGTGTTGTAAAAGGACACCTTCACTGAAGCAAGTCG 

ATGAAAATAAATAT1T1^\^TATTI7\TTTATGA^ 

Cl"l"ri"l'ACITCTAGGAAGTCrCAAAAGACCATCTTAAAT^ 

GCAACAAGTCAGATAGTTAGAATCGAAGTTTTTCAAATCCATTGOT 

ATTCTGTCACTTGGCTTCGATTTTTATATTTTCCTACT 

GTTTGATTTTTCTTTCTTTCT^ 

ATTTGCTATAATAAAGAAAATTCTTGTGACTTTAAAAAAAAA 
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Figure 86 

MLRAPGCLLRTSVAPAAALAAALLSSLARCS LLEPRDPVASS LS PYFGTKTRYEDVNPVL 

LSGPEAFVnUDPELLEGTCTPVQLVALIRHGTRYPTVKQIR^ 

STGSRDI/SAAIADWPLWYJUWMIX^ 

TSSKHRCMDS SAAFLQGLWQHYHPGLPPPDVADMEFGP PTVNDKLMRFFDHCEKFLTEVE 
KNATALYHVEAFKTGPEMQNILKKVAATLQVPVNDLNADLIQ^ KGVXSP 
WCD VFDI DDAKVLEYLNDLKQ YWKRGYGYT I NS RSS CTLFQD I FQHLDKAVEQKQRSQPI 
SS PVILQFGHAETIiLPLLSI^GYFKDKEPLTAYNYKKQMHRKFRSGL I VPYASNLI FVL Y 
HCENAKTPKEQFRVQMU^KVLPIAYSQE^ 
NSTSDEL 
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Figure 87 

GGGACTACAAGCCGCGCCGCGCTGCCGCTGGCCCCTCAGCAACCCTCGAC 
ATGGCGCTGAGGCGGCCACCGCGACTCCGGCTCTG CG 
CTGCIXXrrTTTCAGGGGCTGCCTGATAGGGGCTGTAAAT 
CCAGTGGTACAGGAATTTGAAAGTGTGGAA 

CTGTCTTGCATCATTACGGATTCGCAGACAAGTGACCCCAGGATCGAGTGGAAGAAAATT 
CAAGATGAACAAACCACATATGTGTTTTTTGAC^ 

CGTGCAGAAATACTGGGGAAGACATCC CIX3AAGATCTGGAATGTGACACGGAGAGACTCA 

GCCCTTTATCGCTGTGAGGTCGTTGCTC^^ 

ATO^GTTAACTGTGCAAGTGAAGCC^GTGACCCCTC^ 

CCAGTAGGCAAGATGGCAACACTGCACTGCCAGGAGAGTGAGGGCCACCCCCGGCCTCAC 

TACAGCTGGTATCGCAATGATGTACCACTGCCaiCGGATTCCA^ 

CGCAATTCTTCTTTCCACTTAAACTCTGAAAC^ 

AAGGACGACTCTGGGCAGTACTACTGCATTGCTTCCAATGACGCAGGCT 

GAGGAGCAGGAGATGGAAGTCTATGACCTGAACATTGGCGGAATTATT^ 

GTTGTCCTTG CTGTACTGGC CCTGATCACGTTGGGCATCTGCTGTGCATACAGACGTGGC 

TACTTCATCAACAATAAACAGGATGGAGAAAGTTACAAGAACCCAGGGA^ 

GTTAACTACATCCGCACTGACGAGGAGGGCGACTTCAGACACAAGTCATC ^ 

TGAGACCCGCGGTGTGGCTGAGAGCGCACAGAGCGCACGTGCACATACCTCTGCT 

CTCCTGTCAAGGCAGCGAGAGCTGATGCACTCGGACAGAGCTAGACACTC 

TTTTCGTTTTGGCCAAAGTTGACCACTACrCTTCTTACT 

AGAATTTTCCTCAAGATGGACC CGGTAAATATAACCACAAGGAAGCGAAACTGGGTGCGT 

TCACTGAGTTGGGTTCCTAATCTGTTTCTGGCCTGATTCCCGCATGACT 

CTTAAAGAGTTTGCTCACGTAAACGCCCGTGCTGGGCCCTGTGAAGCCAGCATGT^ 

ACTGGTCGTTCAGCAGCCACGACAGCACCATGTGAGATGGCGAGGTGGCTGTO 

AGCAGCGCATCCCGGCGGGAACCCAGAAAAGGCTTCTTACACA 

GCAGCCTTACTTCATCGGCCCACAGACACCACCGCAGTTTCTTCTTAA^ 

TCGGTGTTGCAGTGTCCATTGTGGAGAAG C rr TTTGGATCAGCATTTTGTAAAAACAACC 

AAAATCAGGAAGGTAAATTGGTTGCTGGAAGAGGGATCTTC 

CCAACAGGGTGTC^GGATTTAAGGAAAACCTTCGTCTO 

GAAATATGCTTTTCTATGGGTCTTGTTTATTTTATAAAA 

AAGGATGTATTTTGATTATTGAAAAGAAAATTTCTATTTA^ 

ACAATGTTAAATAACCTATTTTTTTAAAAAAGTTC 

CTAGTGTTAAATTGGAAAATATCAATAATTAAGAGTA^ 

GGAAGTTTACTGTGATGTTCCTTTTCTCACACAAGTTTTAGCCTTTI^ 

CATACTGTCTACACATCAGACCATAGTTGCTTAGGAAACCTTTAAAAATT 

CCAGTTAAGCAATGTTGAAATCAGTTTGCATCTCTTCAAAAGA^ 

TTGAACTCCCTCTTCCTGAGATGACTAGGACAGTCTGTACCC^ 

CTCAGATGTACATACACAGATGCCAGTCAGCTCCTGGGGTTGCGCCAGGCGCCCCCGCTC 
TAGCTCACTGTTGCCTCGCTGTCTGCCAGGAGGCCCTGCCATCCTT^ 
GCTGTGTCCCAGTGAGCTTTACTCACGTGGCCCTTGCTTCATCCAGCACAGCTCTCAGGT 
GGGCACTGCAGGGACA 

CTGGTGTCTTCCATGTAGCGTCCCAGCTTTGGGCTCCTGTAACAGACCTCTTTT^ 

TGGATGGCTCACAAAATAGGGCCCCCAATG CTATTTTTTTTTTTTAAGTTTGTTTAATTA 

ITTGTT A AGATTGTCTAAGGCCAAAGGCAATTGCGA 

CATTTTTAAAAGAAAATGGATCCCACTGTTCCTCTTTGC'CA 

CCACAGGCTCTGTCGCATTTCAAAACAAACCATGATGGAGTGGCGGC 

TAAAGAACGTCAGGTGGAG CAG C CAG GTGAAAGG C CTGG CGGGGAGG AAAG TGAAACGCC 

TGAATCAAAAGCAGTTTTCTAATTTTGACTTTAAATTTTTCATC 

GTTCTCAGCCTCACATGCrCTGCCGTGCTGGAC^ 
AGCTGCTGAGAAGGAGCACTCCACTGTGTGCCTGGAGAATGGCT 

TCTITCAGCTTCCAG TGTCTTGGGTTTTTTATACTTTGACAG CTTTTTTTTAATTG CATA 
CATGAGACTGTGTTGACTTTTTTTAGTTATGTGAAACACTTTGCCGCAGGCC 
GAGGCAGGAAATGCTCCAGCAGTGGCTCAGTGCTCCCTGGTGTCTGCTGCATGGCATCCT 
GGATGCTTAGCATGCAAGTTCCCTC CZATCATTGCCACCTTGGTAGAGAGGGATGGCTCCC 
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CACCCTCAGCGTTGGGGATTCACGCTCCAGCCTCCTTCTTGGTTGTCATAGTGATAGGCT 
AGCCTTATTGCCCCCTCTTCTTATACCCTAA 

GGTCTGAAAAAGTAGAGAGAAGTGAAAGTAGAGTCTGGGAAGTAGCTC 
ACTAGACGGAAAAGGAATACTCGTGTA1TTTAAGATATGAATGTGACT 
CCGATACGAGGCTGTGATTCTGCCTTTGGATGGA^ 

TTGTACTAACACACCGTAATTTGGCATTTGTTTAACCTCATTTATAAAAGCTT 
ACCCA 
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Figure 88 

MALRRPPRLRLGMILPDFFI^I^FR^ ITDSQT 
SDPRIEWKKIQDE^TTYVFFDNKIQGDIAGRAEILGK^ 
NDRKSIDEIVIELTVQVKPVTPVCRVPXAVPVGKHA.TIaH 
PTDSRANPRFRNSSFHLNSETGTLVFTAVHKDDSGQYYCIASOT 

NIGG 1 1 GGVLVVLAVUVLI TLGI CCAYRRGYFI NNKQDGES YKNPGKPDGVNYI RTDEEG 
DFRHXSSFVI 
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Figure 89 

CCCACGCGTCCGGACAAACTGGAGGTGAAAGGAGCTGGTACTGTCCACTGT^ 

GCTGAACCTGAGACGCGAGCGGACCAG 

ACCCTCTGTGCCCTCTGGGACCCGCTTCACCAAC^^ 

GT GTGG TATCG GACGTG GGAGAGAGTCCTCTGTTTGCCACCTG 

GACTTTGGAGATra 

CTTTTG AGAGT TTTTrmTTATTGTGATTTATATT^ 
GGGGGACTTTTGTGACTGTTAAAATAAGGTGAAAAGCAATAAGG 
ATGTTTAAGTCCTGGTCAGTTGTCTTGGTTCTCGGATT 
AGGCCAACCAAAGAAGGAGGATATGGCCTTAAATCCTATC^ 

C ATAAG CAGGAAAAAAATCAAGAAAGTTCAAGAGTCAAAGGATTTATGATT 

CCTITTGGATCTTGTGAAAATAAGTACTGTGGTTTG^ 

GAGAC^GGGCAAGCAGAATGTGCCTGTATGGACCTTTC 

TGTGGATCTGACGGAGAATTCTATGAAAAC CACTGTGAAGTGCACAGAGCTG CTTGCCTG 

AAAAAACAAAAGATTACCATTGTTCACAATGAAGACTGCTT CTTTAAAGGAGATAAGTG C 

AAGACTACTGAATACAGCAAGATGAAAAATATGCTATTAGATTTACAAAATCAAA^ 

ATTATGCAAGAAAATGA AAAT CCTAATGGCGACGACATATCTC 

GATCAAATGTTTAAATATTTTGATGCAGACAGTAATGGACTTGTAGAT^ 

ACTCAGGTGATAAAACAGGAAGAACTTGGCAAGGATCTCTTTGATTGTACT 

CTATTGAAATATGATGATTTTAATGCTGAC^ 

GCATTCCAAGTGATCCAGTTGAGTCTGCCAGAAGATCAGAAACTAAGCATCACTGCAGCA 

ACTGTGGGAC^AAGTGCTGTTCTGAGCTGTGCCATTCAAGG^CC 

ATCTGGAAAAGGAACAATATTATTCTAAATAATTTAGATTTGGAAGACAT 

GGAGATGATGGGTCCTTGTATATTACTAAGGTTACCACAACTCACGTTGG CAATTACACC 

TGCTATGCAGATGGCTATGAACAAGTCTATCAGACTCACATCTTCCAAGTGAA 

CCAGTCATCC 
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Figure 90 

MFKCWSV\^VLGFIFLESEGRPTKEGGYGLKS^ 
PraSCENKYCGIX^CVTSRBTGCAECACMD^ 

KKQKI TI VHNBDCFFKGDKCKTTEYS KMKl^MLLDLQNQKYIMQENENPNGDDI SRKKLLV 

DQMFKYFDADSNGL VD I NELTQ VI KQEEI<GKDLFDCTLYVlJ*KYDDFNADKHLALBEFyR 

AFQVIQI^LPEI^KI^ITAATVGQSAVLSC^ 

GDDGSLYI TKVTTTH VGNYTCYADGY EQ VYQTH I FQVNVPFV7 . . . 



93 / 133 



(C) 2003 Copyright Derwent Information Ltd. 



WO 00/53758 



PCT/US00/O5841 



Figure 91 



GCAGCGAGCGCCGGGTGCGGCCCTGCCGCCGCAGGGATGTGACCTTCACCGTCGCTT^GC 
CAGGATG^CCGGAGCCCGTCTCTCGCGGCGTCCGCGCCTCGCTT 

CTGACCGCACGCTCCCGGCTGCTAGGCTCCCCGGCACCGGCCTCGCC 

A^CGCCACCGCCCGGGCCCGCCGCCGCCCTGGGCACTGCGCTTCTGCTGCTCCTG 

GCTTCCGAGTCTTCTCACACTGTGCTGTTGCGGGCGCGTGAGGCGGCGCAGTTTCTGCGG 
CCCAGGCAGCGCCGC 

GCCTACC^GTCTTCGAGGAGGCCAAGCAGGGCCAC^^ 

GTGTGCAGCAAAGAGGAGGCCAGAGAGGTGTTCGAGAACGACCCCGAGACGGAGTATTTC 
TATCCACGATATCAAGAGTGCATGAGA/^AATATGGO^GGCCTGAAGAAAAAAACCCAGAT 
TT CGCCAAATGTGTTCAGAACTTG CCTGAC CAG TG CAC CC CAAACC CTTGTGATAAGAAG 
GGTACTCATATCTGCC^GACCTCATGGGCAACTTOTCT 

GGAGGCCGGCTCTGTGACAAAGATGTCAATGAGTGTGTCCAGAAGAATGGGGGCT 

CAGGTCTGCCACAACAAACCAGGAAGCTTCCAATGTG CCTGCCATAGTGGCTTCTCGCTT 

GCATCAGACGGCCAG^CCTGCCAAGATATCGATGAATGCACAGACTCAGIACACCTGT^ 

GACGCGCGATGCAAGAACTTGCCAGGCTCCTACTCTTGCCTCTGCGATGAGGGATATACA 

TACAG CTCCAAGGAGAAGACCTGCCAAGATGTGGACGAGTGCCZAGCAGGATCGCTGTGAG 

CAGACCTGTGTCAACTCCCCAGGCAGCTATACCTGCCACTGTGATGGGCGAGGGGGCCTA 

AAACTATCCCCAGACATGGATACTTGTGAGGACATCTTACCATGTC 

GCCAAGAGCGTGAAGTCCTTGTACCTGGGCCGCATGTTCAGCGGGACCCCCGTGATTAGA 
CT^CGCTTC^GAGGCTTCAGCCTAC^ 

GACCCTGAAGGAGTCCTCTTCITCGCTGGAGGCCGTTCAGACAGCACCTGGATTGTC 

GGCCTAAGAGCTGGGCGGCTTGAGCTGCAGCTTCGGTACAATGGCGTTGGGCGCATCACC 

AGCAGCGGGCCAACCATCAACCACGGCATGTGGCA^CTATCTCCGTGGAAGAGCTGGAA 

CGTAACCTTGTCATCAAGGT CAACAAAGATGCTGTAATGAAGATCG CGGTAGCTGGGGAG 

CTGTTTCAGCTGGAGAGK3GGCCTCTATCACCTGSAATCTCACCGTGGGCGGCATTC 

AAGGAGAGTGAGCTCGTCCAGCCGATTAACCCTCGCCTGGATGGGTGCATGAGGAGTTGG 

AACTGGCTGAACGGGGAAGACAGCGCCATCCAGGAGACAGTCAAGGCAAA^ 

CAGTGCTTCTCTGTGACAGAAAGGGGCTCCTTCTTCXCGa 

AGGCTCAACTACACCCGAACATCGCTGGATGTCGGCACGGAAACCACCTGGGAAGTTAAA 

GTTGTGGCTCGGATCCGCCCTGCCACGGACACGGGGGTGCTGCTGGCGCTGGTGGGGGA 

GACGATGTCGTCATCTCTGTGGCCCTAGTCGACTACCACTCTACAAAGAAGCTCAAGAAG 

CAGfTTGGTGGTCCTGGCAGTTGAGGATGTTGCCCTGGCACTGATGGAAATCAAGGTGTGC 

GACAGCCAGGAACACACGGrCACTGTCTCCCTGCGGGAGGGTGAGGCCACCCTAGAAGTG 

GATGGCACAAAGGGCCAGAGTGAAGTGAGCACTGCCCAGCTGCAGGAGCGACTGGACACA 

CTTAAGACACATCTGCAAGGCTCTGTG»CACCTATGTTGGAGGCCTG 

GTGATTTCTGCACCCGTCACTGCGTTCTACCGCGGATGCATGACTCTGGAGGTAAACGGG 

AAAATCCTGGACCTGGATACGGCCTCGTACAAGCACAGTGACATCACCTCCCACTCC^ 

CCGCCTGTGGAGCATGCCACCCCCT^CCGAGCTGCAAGAGGGCTCCACAC 

AAAAATGAAGCAGGGTTTGGACACACAGCACTGGCTCCTCTCGCZATGGTCCTGCAACACT V 
<^GCAGa7rGGACCGCCCTTGTGG ^ 

ACATATCTGTACATAATGGACGGGTGTCGGGTCACCGGCTGCTC 

CCTGGTGGGAGCTGGCTGGAAGGGGCTGGGCTAGAGGGGCTGGCAGTTTGCAGCAGAACX5 

GATGTGAAG71AAATAATTCTCTATTATTTTTATTACTACATGCTTCTTTCTC 

ATATGGAAAATAAAATATTTACAGAAACCTTTTTAAAAAAAAAAAAAAAAA 
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Figure 92 

MPFPPGPAAALGTA T J J il >TiT iRSESSHTVLIJlftilEAAQFLRPRQRRAYQVPBBRKQGHLE 
RBCVEEVCSKESAREVFENDPETEnfFYPRYQEC^ 

PNPCDKXGTHI CQDLMGNFFCVCTDGWGGRliCDKDVNECVQKNGGCSQVCHNKPGS FQC 
ACHSGPSLASDGQTCQDI DECTDSDTCGDARCKNLPGSYSCLCDEGYTYSSKE1CTCQDV 
DEOQQDRCEQTCVNSPGS YTCHCDGRGGLKLS PDMDTCEDILPCVPFSMAKSVKSLYLG 
RMFSGTPVTRLRFKRLQPTRIJIAEFDFRTFDPEGVLFFAGGRSDSTWIV^ 
QLRYNGVGR ITSSGPTI NHGMWQTI S VEELERNLVT KVNKBAVMKIAVAGELFQLERGIj 
YHLNLTVGGI PFK^ELVQPINPRIJXK31RSWNWI^GEDSAI©ETra 
RGSFFPGNGFATYRIiNYTRTSLD VUT tfriV TEVRVVAR irpatdtgvllalvgdddvvi s 
VALVDYH ST K X LKKQLVVLAVEDVAIALME I KVCDSQ EHTVTVS LREGEATLEVDGTKG 
QSEVSl^I^ERUnijKTHLQGSVHTYVGGI^EVSVISAP 
LDTASYKHSDITSHSCPPVEHATP 



95 / 133 



(0 2003 Copyright Derwent Information Ltd. 



WO 00/53758 



PCT/US00/05841 



Figure 93 

AGTCGACTGCGTCCCCTGTACCCGGCGCCAGC^^ 

GCTGCACCGGGCCTGGCAGCGCTCCGC^^ 

TGGCOTCTGGGCXTCGCGGGGGGATTCTTGGCAGTT^^ 

GAGGGGAAGGGAGGGGGAACCGGGTTGGGGAAGCCAGCTGTAGAGGGCGGTGACCGCGCT 

C^GACACAGCTCTGCGTCCTCGAGCGGGACAGATCCAAGTTGGGAGCAGCT 

GGGGCCTCAGAGA 

ATWGGCCGGCGTTCGCCCTGTGCCTCCTCTGGCAGGCGCTCTGGCCC 

GGCGAACACCCCACTGCCGACCGTGCTGGCTGCTCX3GCCTCGGGG 

CACCACGCTACCATGAAGCGGCAGGCGGCCGAGGAGGCCTX3CATCCTG 

CTCAGCACCGTGCGTGCGGGCGCCGAGCTGCGCGCTGTGCTCGCGCTCCTGCGGGCAGGC 

CCAGGGCCCGGAGGGGGCTCCAAAGACCTGCTGTTCTGGGTCGCACTGGAGCGCAGGCCT 

TCCCACTGCACCCTGGAGAACGAGCCTTTGCGGGGTTTCT^ 

CGGAGATGCGCGGTACTCCAGGCCACCGGTGGGGTCGAGCCCGCAGGCTGGAAGGAGATG 

CGATGCCACCTGCGCGCCAACGGCTACCTGTGCAAGTACCAGTTTGAGGTCTTG^ 

GCGCCGCGCCCCGGGGCCGCCTCTAACTTGAGCTATCGCGCGCCCTTCCAGCTGCACAGC 

GCCGCTCTGGACTTCAGTCCACCTGGGACCGAGGTGAGTGCGCTCrGCCGGG 

CCGATCTCAGTTACTTGCATCGCGGACGAAATCGGCGCTCGCTGGGACAAACTCTCGGGC 

GATGTGTTGTGTCCCTGCCCCGGGAGGTACCTCCGTGCTGGCAAATGCGCAGAGCTCCCT 

AACTGCCTAGACGACTTGGGAGGCTTTGCC^ 

AAGGACGGCCGCTCTTGTGTGACCAGTGGGGAAGGACAGCCGACCCTTGGGGGGACCGGG 

GTGCCCACCAGGCGCCCGCCGGCCACTGCAACCAGCCCCGTGCCGCAGAGAACATGGCCA 

ATCAGGGTCGACGAGAAGCTGGGAGAGACACCACTTGTCCCTGAACAAGAC^^ 

ACATCTATTCCTGAGATTCCTCGATGGGGATCACAGAGCZACGATGTCTACCOT 

TCCCTTCAAGCCGAGTCAAAGGCCA^ 

AA TTCTA CGACTTCCTCTGCCACTCCTCAGGCTTTCGACTCCTCCTCT 
ATATTTGTGAGCACAGCAGTAGTAGTCTTGGT^ 

GTCAAGCTCTGCTTTCACGAAAGCCCCTCTTCCCAGCCAAGGAAGGAGTCTATGGGCCCG 

CCGGGCCTGGAGAGTGATCCTGAGCCCGCTGCTTTGGGCTCCAGTTCTGCACATTG 

AACAATGGGGTGAAAGTCGGGGACTGTGATCTGCGGGACAGAGCAGAGGGTGCCTTGCTG 

GCGGAG TCCCC TC TTGGC TCTAGTGATGCATAGGGAAACAGGGGACAT^ 

GAACAGTTTTTCACTTTTGATGAAACGGGGAACCAAGAGGAA 

C1TACTTGTGTAACTGACAATTTCTGCAGAAATCCCCCTTCCTCT 

CAC^GGAGCTAAATCAGAACTGCACACTCCTTCCCTGATG^ 

CCTTTAGGATGGTGATACTGGGGGACCGGGTAGT^^ 

TATTCGGAGAATTTGGAGAAGTGATTGAACTTTTCAAGACATTGGAAAC^ 

AATATAATTTACATTAAAAAATAATTTCTACCAAAATGGAAAGGAAATC 

TCAGGCTAGGAGTATATTGGTTCGAAATCCCAGGGAAAAAAATAAAAATAAAAAATTAAA 
GGATTGTTGAT 
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Figure 94 

MRPAFALCIXWQALWPGP(^EHPTADRAGCSAS(^ 

LSTVRAG&SLSAVLJilAJiliGPGPG 

GGLESIHTjQWVEEPQRSCTARRCAVLQATGGVEPAGW^ 

APRPGAASNI^YRAPFQLHSAALDFS P PGTEVSALCRGQLP I SVTC IADE IGARWDKLSG 
DVIXZPCPGRYI»RAGKCAELPNCI^DLGGFACECATGFEIX3 

VPTRRPPATATSPVPQRTWPIRVDEKLGETPLVPEQDNSVTSI PEI PRWGSQSTMSTLQM 
SLQAESKATITPSGSVI S KFNSTTSS ATPQAFDSS SAWFIFVSTAWVLVI LTMTVLGL 
VKLCFHESPSSQPRKESMGPPGLESDPE PAALGSSSAHCTI^GVICVGDCDLRDRAEGALL 
AESPLGSSDA 
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Figure 95 

GA.CTAGTTCTCTTGGAGTCTGGGAGGAGGAAAGCGGAGCCGGCAGGGJVGCG 
TGGGGTGACGGCAGGGCAGGGGGCGCCTGGCCGGGGAGAAGOGCG 
CCAACTGGAGGGTCCGGAGTAGCGAGCGCCCCGAAGGAGGCCATCGGGGAGCCGGG^ 
GGGACTGCGAGAGGACCCCGGCGTCCGGGCTCCCGGTGCCAGCGCT 

AT^GGCCACTCCTCGTCCTGCTGGTCCTGGGCCTGGCGGCCGGCTCGCCCCCACTG 
C^C^CAAGATCCCCAGCCTCTGCrc^ 

CATGGCAGCCAGGGCTTGCCGGGCCGCGATGGCCGCGACGGCCGCGACGGCGCGCCCGGG 
GCTCCGGGAGAGAAAGGCGAGGGCGGGAGGCCGGGACTGCCGGGACCTCGAGGGGACCCC 
GGGCCGCGAGGAGAGGCGGGACCCGCGGGGCCCACCGGGCCTGCCGGGGAGTGCTCGGTG 
CCTCCGCGATCCGCCTTCAGCGCCAAGCGCTCCGAGAGCCGGGTGCCTCCGCCGTCTGAC 
GCACCCTTGCCCTTCGACCGCGTGCTGGTGAACGAGCAGG 

GGCAAG1TCACCTGCCAGGTGCCTGGGGTCTACTACTTCGCCGTCCATGCCA 

TTTTTCGGGGGGTGGCCCAAGCCAGCCTCGCTCTCGGGGGGGGCCATGGTGAGGCT 
CCTGAGGACCAAGTGTGGGTGCAGGTX3GGTGTGGGTGACTACATTGGCAT 

GTCTTTGCTTAOTGCCCACTGCAAAGTGAGCTCATGCTCTCACTCCTAGAAGGA 

GAGGCTGACAACCAGGTCATCCAGGAGGGCTGGCCCCCCTGGAATATTGTGAATGACTAG 

GGAGGTGGGGTAGAGCACTCTCCGTCCTGCTGCTGGCAAGGAATGGGAACAGTGGCTCT 

TGCGATCAGGTCTGGCAGCATGGGGCAGTGGCTGGATTTCTGCCCAAGACCAGAGGAGTG 
TGCTGTGCTGGCAAGTGTAAGTCCCC»^ 

CTCTCTOCTGGTCCTCTGCTTCrCTGGA 

GCCCTTTTCTCAGAGATCACTCAATAAACCTAAGAACCCTC^ 
AAAAAAAAAAA 
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Figure 96 

MRPLLVLLLLGLAAGS PPLDDNKI PSLCPGHPGLPGTPGHHGSQGLPGRDGRDGRDGAPG 
APGEKGEG6RPGLPGPRGDFGPRGEAGPAGPTGPAGECSVPPRSAFSAKRSESRVPPPSD 
APLPFDRVLVNEQGHYDAVTGKFTCXJVPGVYYFAVHATVYR^ 

FFGGWPKPASLSGGAMVRLEPEDQVWVQVGVGDYI GI YAS IKTDSTFSGFLVYSDWHSS? 
VFA 
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Figure 97 

TG 

CCCCTGCTGCTGCTGCCCCTGCTGTGGGGGGGGTCCCTCCAGGAGAAGCCAGTGTACGAG 

CTGCAAGTGCAGAAGTCGGTGACGGTGCAGGAGGGCCTGTGC 

TTCTCTTACCCCTCGAGATCCTGGTATTCCTCTCCCCCACTCTACGTCT 

GACGGGGLAGATCCCATACTACGCTGAGGTTGTGGCCACAAACAACCCAGLACAGAAGAGTC 

AAGCCAGAGACCCAGGGCCGATTCCGCCTCCTTGGGGATGTCCAGAAGAAGAACTGCTCC 

CTGAGCATCGGAGATGCCAGAATGGAGGACACGGGAAGCTA TT TCTTCCGCGTGGAGAGA 

GGAAGGGATGTAAAATATAGCTACCAACAGAATAAGCTGAA 

ATAGAGAAACCCGACATCCACTTTCTGGAGCCTCTGGAGTCCGGCCGCCCCACAAGGCTG 
AGCTGCAGCCTTCCAGGATCCTGTGAAGCGGGACCACCTCTCACATTCT 
AATGGCCTCAGCCCCCTGGACCCTGAGACCACCCGCTCCTCGGAGCTCACCCTCACCCCC 
AGGCC CGAGGACCATGGCACCAACCTCACCTGTCAGGTGAAACG CCAAGGAGCTCAGGTG 
ACCACGGAGAGAACTGTCCAGCTCAATGTCTCCTATGCTCCACAGAACCTCGCCATCAGC 
ATCTTCTTCAGAAATGGCACAGGCACAG CCCTG CGGATCCTGAG CAATGG CATGTCGGTG 
CCCATCCAGGAGGGCCAGTCCCTGTTCCTCGCCTGCACAGTTGACAGCAACCCCCCTGCC 
TCACTGAGCTGGTTCCGGGAGGGAAAAGCCCTCAATCCTTCCCAGACCTCAATGTCTGGG 
ACCCTGGAGCTGCCTAACATAGGAGCTAGAGAGGGAGGGGAATTCACCTGCCGGGTTCAG 
CATCCGCTGGGCTCCCAGCACCTGTCCTTCATCCTTTCTGTGCAGAGAAGCTCCTCTTCC 
TGCATATGTGTAACTGAGAAACAGCAGGGTTCCTGGCCCCTCGTCCTCACCCTGATCAGG 
GGGGCTCTCATGGGGGCTGGCTTCCTCCTCACCTATGGCCTCACCTGGATCTACTATACC 
AGGTGTGGAGGCCCCCAGCAGAGCAGGGCTGAGAGGCCTGGCTGAGCCCCTCCCGCTCAA 
GACAG AATTGAG GTG TG GA CACTTAGCC CTGTGGGACACATG CAGGACAT CACTGTCAGC 
TTCTTTCTGGAAGCTCACATCCCACTGACTACCCCTCTTTTCCTTCCT 
TTCTACTTATTCCCCTCTGCTTGTGAGTCTTGCCCCANCACACCTGCATCCCCATCTGCA 
NCCCATCCCCTCTCCANCTGCCCTTCTCTTCCCTCTCCATCCAN 
AAGGG AATGTACITTCGGT CCTATAC CCCATTAC CATTAC CAAAAGTTAC 
TTTTT 
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Figure 98 

PUOXPLLWGGSWEIO>VYELQVQKSVTVQEGLCVLV^ 

DOT I P YYAEWATNNP DRRVKP ETTQGR FRLLGD VQKKNCS LS I GDARMEDTGSYFFR VER 
GRDVKYSYQQNKIiNLEVTALIEKPDIHFXEPLESGRPTRLSCSLPGSCEAGPPLTFSWTG 
NALS PLOP ETTRS S ELTLTPRP EDHGTNLTCQVKRQGAQVTTERTVQIJ^S YAPQNLAI S 
I FFRNGTGTALRILSNGMS VP IQEGQS LFLACTVDSNP PASLSWFREGKALNPSQTSMSG 
TLELPNIGAREGGEFTCRVQKPLGSQHI^FILSVQRSSSSCICVTEKQQGSWPLVLTLIR 
GALMGAGFLLTYGLTW I YYTRCGG PQQS RAERPG 
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Figure 99 

TTCGTGACCCTTGAGAAAAGAGTTGGTGGTAAATGTGCCACGTCCT 
TCCTGAACTTGTCTGAAGCCCTTGTCCGTAAGCCTTGAACTACGTT 
CTCGAGGGACCTTTCGC^ 

TTGATCCCTGAACCAGAAGTGAAAAT^^ 

AAGACC31AAGGAGGGGATTTGATGTTGGTCCACTATGAA 

TCCTTATTTCACTCCACTCACAAACATAACAATGGTCAGCCCATl^ 

ATCCTGGAGGCTCTCAAAGGTTGGGACC^GGGCTTGAAAGGAATGTC 

AGAAAGCTCATCATTCCTCCTGCTCTGGGCTATGGAAAAGAAGGAAAAGCT 

CCAGAAAGTACACTGATATTTAATATTGATCTCCTGGAGATTCGAAATGGAC 

CATGAATCATTCCAAGAAATGGATCTTAATGATGACTGGAAACTCTCTAAAGATGATC 
^C^ATA^^ 

GATGCTTTGGTGGAGGATATTTTTGATAWVGAAGATGAAGAC^ 

GCCAGAGAATOACATATAAACACGATGAGTTATAGAGATACATCTACCCTTT^ 

G CACT CATCTTTCAAGAGAGGGCAG TCATCTTTAAAGAACATTTTATTTTTATACAATGT 

TV i ri LTTGi. 1 T i u i xi i-i-iATTTTTATATATTTTTTCTGACTCCTATTTAAAGAACC CC 

TTAGG TTTCTAAGTrACCCATTTCTTTCTX^ 

CTTTGAATAGAAGACTTCTGGACAATTTTTCACTTTCAC^GATATG 
TTTCTCACTTATAAATTTAAAATGTTGCAACTGGGAATATACCAC^ 
TATAGCACAAATTAG CACCCTATATTTCTG CTTC CCTCTATTTTCTCCAAGTTAGAGG TC 
AACATTTGAAAAGCCTTTTGCAATAGC C CAAGGCTTG CTATTTTCATGTTATAATGAAAT 
AGTTTATGTGTAACTGGCTCTGACTCTCTGCTTGAG^ 

CCTGACTTGTTAATGGCTACTGCITTACrAAGGAGATGTGCAATGCT 
AGGTTAATAGCCAGGCATGGT 

TCCTCATGCCTGTAATCCCAGCACTTTGGGAGGCTGAGGCGOT 

GGAGTTCGAGACCAGCCTGACCAACACGGAGAAACCCTATCTCTACTAA^ 

AGCCCGGCGTGGTGATGCGTGCCTGTAATCCCAGCTACCCAGGAAGGCTGAGGCGGCAGA 

ATCACTTGAACCCGAGGCCGAGGTTGCGGTAAGCCGAGATCACCTNCAGCCTGGACACTC 

TCTXTTCGAAAAAAGAA^GAACACGGTTAATACCATATOAATATGTATG CATTGAGACAT 

GCTACCTAGGACTTAAGCTGATGAAGCTTGGCTCCTAGTGATTGGTGGC 

AATAGGACAAATCATCTATGTGTGAGTTTCTTTC 

ATGAGGTAGAAAGTTATATTTATATTCAATATTTACTTCTTA^ 

TCCTGv, ixu 1 1AA.TGGGTAGTCTATAGTATATTATACTACAATAACATTGTATCATAAG 
ATAAAGTAGTAA^CCAGTCTACATTTTCCCATTTCTGTCTCATCAAAAACT 
TGGGTGTGGTGGCTCATGC CTGTAATCCCAGCACTTTGGGGG CCAAGGAGGGTGGATCAC 



ATACAAAAATTAGKTCAGGCGTCGTGGTGCACACCTGTAGTCCCAGCTACT 
AGACAGGAGATTTGOTGAACCCGGGAGGCGGAGGTO 

J 2?^" ^ CCTGGGTGACAGAG CAAGACT CCATCTCAAAAAAAAAAAAAAGAAG CAGA 
CCTACAGCAGCTACTATTGAATAAATACCTATCCTGGATTTT 
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Figure 100 

MRLFXWHAVLTLFVTSLIGALI PEPE^ra 

SLFHSTHKHNNGQP I WFTLGI LEALKGWDQGLKGMCVGEKRKLI I PPALG YGKEGKGKI P 

PESTI*IF7JIDLLEIRNGPRSHESFt3EMDLND^ 

DALVEDI FDKEDEDKDGFI SAREFTYKHDEL 
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Figure 101 



GAATTCCGGGCCCCAGGATGCCAACTTTGAATAGG 
ATGAAGACTAC 

AACTOOTCCCTTCTCATCTG^ 

TGAGACCATAGAGATTATCGTGGAGAATAAGGTCAAjCK^ cagctaa 
CTATCCCTCCACTGTAACGAAGACTCTCTCra 

GGCCTCCTGCCCTGCTGGGATGACTGCTACTGGGTGTGCTTGTGGCTTTGCCTGTGGATC 



^CCCGCTGCTGCCAACTGTCCE^GAATGAAGAGGTGGAGAACC 
GAATCTAACAAAAACTGCAGTCTCAATTTGGAAATCTGACTCATGTGC 
TCATATTGCC CATTTACC CTGCTTCTTGAAATGCTTCTTGAAAAATAAAGACAAATTTG C 




ATGTG • 
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Figure 102 
MKTTTCSLLICISIJ<QLMVPVNTDETIEIIV^ 

TMNRWAS CPAGMTATGCACGFACGSWEI QS GDTCNCLCLLVDWTTARCCQLS 
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Figure 103 

GCCAGGGGAAGAGGGTGATCCGACCCGGGGAAGGTCGCTGGGCAGGGCGAGTTGGGAAAG 
CGGCAGCCCCCGCCGCCCCCGCAGCCCCTTCTCCTC 

CTCGCTGGAGGCCAGGCCGTGCAGCATCGAAGACAGGAGGAACTGGAGCCI^ATTGGCCG 
GCCCGGGGCGCCGGCCTCGGGCTTAAATAGGAGCTCCGGGCTCTGGCTGGGACCCGACCG 
CTGCCGGCCGCGCTCCCGCTGCTCCTGCCGGGTG 

ATGGAAAACCCCAGCCCGGCCGCCGCCCTGGGCAAGGCCCTCTGCGCrCT^ 

ACTCTCGGCGCCGCCGGCCAGCCTCTTGGGGGAGAGTCCATCTGTTCCGCCAGAGCCCCG 

GCCAAATACAGCATCACCTTCACGGGCAAGTGGAGCCAGACGGCCTTCCCCAAGCAGTAC 

CCCCTGTTCCGCCCCCCTGCGCAGTGGTCTTCGCTGCTGGGGGCCGCGCATAGCTCCGAC 

TACAGCATGTGGAGGAAGAACCAGTACGTCAGTAACGGGCTGCGCGACTTTGCGGAGCGC 

GGCGAGGC CTGGGC GCTGATGAAGGAGATCGAGGCGGCGGGGGAGGCGCTGCAGAGCGTG 

CACGAGGTGTITTCGGCGCCCGCCGTCCCCAGCGGCACCGGGCAGACGTCGGCGGAGCTG 

GAGGTGCAGCGCAGGCACTCGCTGGTCTCGTTTGTGGTGCGCATCGTGCCCAGCCCCGAC 

TGGTTCGTGGGCGTGGACAGCCTGGACCTGTGCGACGGGGACCGTTGGCGGGAACAGGCG 

GCGCTGGACCTGTACCCCTACGACGCCGGGACGGACAGCGGCTTCACCTTCTCCTCCCCC 

AACTTCGCCACCATCCCGCAGGACACGGTGACCGAGATAACGTCCTCCTCTCCCAGCCAC 

CCGGCCAACTCCTTCTACTACCCGCGGCTGAAGGCCCTGCCTCCCATCGCCAGGGTGACA 

CTGCTGCGGCTGC^^GAGCCCCAGGGCCTTCATCCCTCCCGCCCCAGTCCTGCCCAGC 

AGGGACAATGAGATTGTAGACAGCGCCTCAGTTCCAGAAACGCCGCTGGACTGCGAGGTC 

TCCCTGTGGTCGTCCTGGGGACTGTGCGGAGGCCACTGTGGGAGGCTCGGGACCAAGAGC 

AGGACTCGCTACGTCCGGGTCCAGCCCGCCAACAACGGGAGCCCCTGCCCCGAGCTCGAA 

GAAGAGGCTGAGTGCGTCCCTGATAACTGCGTCTAAGACCAGAGCCCCGCAGCCCCTGGG 

GCCCCCCGGAGCCATGGGGTGTCGGGGGCTCCTGTGCAGGCTCATGCTGCAGGCGGCCGA 

GGGCACAGGGGGTTTCGCGCTGCTCCTGACCGCGGTGAGGCCGCGCCGACCATCTCTGCA 

CTGAAGGGCCCTCTGGTGGCCGGCACGGGCATTGGGAAACAGCCTCCTCCTTTCCCAACC 

TTGCTTCTTAGGGGCCCCCGTGTCCCGTCTGCTCTCAGCCTCCTCCTCCTGCAGGATAAA 

GTCATCCC CAAGGCTCCAGCTACTCTAAATTATGTCTC CTTATAAGTTATTGCTG CTCCA 

GGAGATTGTCCTTCATCGTCCAGGGGCCTGGCTCCCACGTGGTTGCAGATACCTCAGACC 

TGGTGCTCTAGGCTGTGCTGAGCCCACTCTCCCGAGGGCGCATCCAAGCGGGGGCCACTT 

GAGAAGTGAATAAATGGGGCGGTTTCGGAAGCGTCAGTGTTTCCATGTTATGGATCT 

TGCGTTTGAATAAAGACTATCTCTGTTGCTCACAAAAAAAAAAAAAAAAAAAAAAAAAA^ 
AAAAAAAAAAAAA 
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Figure 104 

MENPS PAAALGKALCALLLATLGAAGOPLGGES I CSAKAPAKY5ITFTGKWSQTAPPRQY 
PLFRPPAQWSSLIXLAAHSSDYSMWRKN^^ 

HEVFSAPAVPSGTGQTSAELEVQRRHSLVSFVTOIVPSPDWFVGTO 
AIJ)LypyDAGTDSGFTFSSPNFATIPQDTVTEITSSSPSHPANSFYYPRIiKALPPIARVT 
LUUJlQSPRAFIPPAPVLPSRDNEIVDSASVPETPLDCE^SLWSSWGLCGGHCGRIXnT^ 
RTRYVRVQPANNGSPCPELEEEAECVPDNCV 
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Figure 105 

GCCTAGCCAGGCCAAGA 

ATOCAATTGCCCCGGTGGTGGGAGCTGGGAGACCCCTGTGCTTGGACGGGACAGGGTCGG 
GGGACACGCAGG 

ATGAGCCCCGCGACCACTGGCACATTCTTCCTGACAGTGTACAG 

CACTC CGAT CGGAATGTATACCCATCAGCAGGTGTCCTCm^ 

GAATATTTTAAGGGGGAATTTCCACCTTACCCAAAAC 

ATAACATTTAATACAAATTTAATGGGTTACCCAGACCGACCTGGATGGCTTCGATAXATC 

CAAAGGACACCATATAGTGATGGAGTCCTATATGGGTCCCCAACAGCTGAAAATGTGGGG 

AAGCCAACAATCATTGAGATAACTGCCTACAACAGGCGCACCTTTGAGACTGC^ 

AATTTGATAATTAATATAATGTCTGCAGAAGACTTCCCGTTGCCATATCAA 

TTCATTAAGAATATGAATGTAGAAGAAATGTTGGC^ 

GGCGCAGTTGAAAAATGTGTGGCAGCCAGAGCGCCTGAACGCCATAAACATCACATCGGCC 
CTAGACAGGGGTGGCAGGGTGCCACTTCCCATTAATGACCTGAAGGAGGGCGTTTATGTC 
ATGGTTGGTGCAGATGTCCCGT1 1 1L1 iLlllI 1 TA CGAGAAGTTGAAAATCCACAGAAT 

CAATTTTACATTGACTGGTGCAAAATTTCATTGGCT^ 

TATCAGGAAGTGATTCGTGGAGAGGGGATTTTACCTGATGGTGGAGAATACAAACCCCCT 
TCTGATTCTTTGAAAAGCAGAGACTATTACACGGATTTCCTAATTACACTGGCTGTGCCC 
TCGGCAGTGGCACTGGTCCTTTTTCTAATACTTGCTTATATCATGTGCTGCCGACGGGM 
GG CGTGG AAAAGAGAAACATGCAAACACCAGACATCCAACTGGTCCATCACAGTGCTATT 
CAGAAATCTACCAAGGAGCTTCGAGACATGTCCAAGAATAGAGAGATAGCATGGCCCCTG 
TCAACGCTTCCTGTGTTCCACCCTGTGACTGGGGAAATCATACCTCCTTTACACACAGAC 
AACTATGATAG GACAAACATGCCATTGATGCAAACGCAG CAGAACTTGCCACATCAGACT 
CAGATTC CCCAACAGCAGACTACAGGTAAATGGTAT CCCTGAA G AAAGAAAACTGA CTGA 
AGCAATGAATTTA^ 

AATGCATGAGCTTTTCTGGCATATGTTATGCATGTTGGCAGTATTAACT 
AATACAACATAACTTTCATTTTACTAATGTATTTTTTTG TACTTAAAGCATTTTTGACAA 
TTTGTAAAACATTGATGACTTTATATTTGTTACAATAAAAOT 
TTATTAATGAAGCCTAAAAAAAAAAA 
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Figure 106 

MQL PRWWELGDPCAWTGQGRGTRRMS PATTGTFLLTVYS I FSKVHSDRNVYPSAGVLFVH 
VLEREYFXGEFPPYPKPGEI SNDP ITFNTNLMGYPDRPGWLRYI QRTP YS D6VLYGSPTA 
ENVGKPTI I EI TAYNRRTFETARHNLI INIMSAEDFPLPYQAEFTIKNMNVEEMLASEVL 
GDFT^VKNVWQPERI^INITSAI^RGGRVPLPINDLKEGV^^ 

NPQNQLRGSQ EMEP VI TCDKKFRTQ FYI DWCKI S LVDKTKQVSTYQEV I RGEG ILPDGGE 
YKP PSDSLKSRDYYTDFLI TLAVPSA VAL VLFLI LAYIMCCRREGVEKRNMQTPD I QLVH 
HSAIQKSTKELRDMSKNREIAWPLSTLPVFHPVTGEI I PPLHTDNYDSTNMPLMQTQQNL 
PHQTQI PQQQTTGJCWYP 
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Figure 107 

ATTCTCCTAGAGCATCTTTGGAAGC 
ATCAGGCCACGATGCTGCATCTITGGCTCTTCTCTC 

TGTTCAAAAGGAACTACAGACGCTCCTGTTGGCTCAGGACTGTGGCTGTGCCAGCCGACA 

CCCAGGTGTGGGAACAAGATCTACAACCCTTCAGAGCAGTGCTGTTATGWTGATGCCATC 

TTATCCTTAAAGGAGACCCGCCGCTGTGGCTCCACCTGCACCTTCTGGCCCTGCT 

CTCTGCTGTCCCGAGTCTTTTGGCCCCCAGCAGAAGTTTCTTGT^^ 

GGTATGAAGTCTCAGTGTCACTTATCTCCCATCTCCCGGAGCTGTACCAGGAACAGGAGG 

CACGTCCTCTACCCATOAAAACCCCAGGCTCCACTGGCAGACGGCAGACAAGGGGAGAAG 

AGACGAAGCAGCTGGACATCGGAGACTACAGTTGAACTTCGGAGAGAAGCAACTTGACTT 

CAGAGGGATGGCTCAATGACATAGCTTTGGAGAGGAGCCCAGCTGGGGATGGCC^ 

C^GGAAGAATGCCTTCCTGCTTCATCCCCTTTCCAGCTCCCOT 

CTTTCATCGGCAATAAAATCCCCCACATTTACCATCT 
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Figure 108 

MRPRCCILALVCWI TVTI^CSKGTTDAPVGSGLWLCQPTPRCGNKIYNPSEOCCYDDAI 

I^IJCETRRCGSTCTFWPCFELCCPESFGPQQKFLVKLRVLGMK£(XHLSPISR 

HVLYP 
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Figure 109 

GGCCGCCTGGAATTGTGGGAGTTGTGTCTGCCACTCrc 
GACT 

ATTCCTCCCCAGAGCCTGCCTTCATCTAGGATGGCTCCTCT^ 

CTGATGGCCGCCTG CTTCACCTTCTGCCTCAGTCATCAGAACCTGAAGGAGTTTGCCCTG 

ACCAACCCAGAGAAGAGCAGCACCAAAGAAACGGAGAGAAAAGAAACCAAAGCCGAGGAG 

GAGCTGGATGCCGAAGTCCTGGAGGTGTTCCACCCGACGCATGAGTGGCAGGCCCTTCAG 

CCAGGGCAGGCTGTCCCTGCAGGATCCCACGTACGGCTGAATCTTCAGACTGGGGAAAGA 

GAGGCAAAACTCCAATATGAC^ACAAGTTCCGAAATAATTTGAAAGGCAA 

ATCAACACCAACACCTACACATCTCAGGATCTCAAGAGTGCACTGGCAAAATTCAAGGAG 

GGGGCAGAGATGGAGAGTTCAAAGGAAGACAAGGCAAGGCAGGCTGAGGTAAAGCGGCTC 

TTCCGCCCCATTGAGGAACTGAAGAAAGACTTTGATGAGCTGAATGTTGTCATTGAGACT 

GACATGCAGATCATGGTACGGCTT^TCAACAAGTTCAATAGTTCCAGCTC CAGTTTGGAA 

GAGAAG ATTG C TGCG CTCTTTGATCTTGAATATTATGTCCATCAGATG^ 

GACCTGCTTTCCTTTGGTGGTCTTCAAGTGGTGATCAATGGGCTGAACAGCACAGAGCCC 

CTCGTGAAGGAGTATGCTGCGTTTGTGCTGGGCGCTGCCTTTTCCAGCAACCC CAAGGTC 

CAGGTGGAGGCCATCGAAGGGGGAGCCCTGCAGAAGCTGCTGGTCATCCTGGCCACGGAG 

CAGCCGCTCACTGCAAAGAAGAAGGTCCTGTTTGCACTGTGCTCCCTGCTGCGCC^CTTC { 
CCCTATGCCCAGCGGCAGTTCCTGAAGCTCGGGGGGCTGCAGGTCCTGAGGACCCTGGTG 
CAGGAGAAGGGCACGGAGGTGCTCGCCGTGCGCGTGGTCACACTGCTCTACGACCTGGTC 
AC GG AGAAGATGTT CG C CGAGGA GG AG G CTG AG CTGA CC CAG GAG ATGT CC C C AG AG AAG 

CTGCAGCAGTATCGCCAGGTACACCTCCTGCCAGGCCTGTGGGAACAGGGCTGGTGCGAG 

ATCACGGCCCACCTCCTGGCGCTGCCCGAGCATGATGCCCGTGAGAAGGTGCTGCAGACA 

CTGGGCGTCCTCCTGACCACCTGCCGGGACCGCTACCGTCAGGACCCCCAGCTCGGCAGG 

ACACTGGCCAGCCTGCAGGCTGAGTACCAGGTGCTGGCCAGCCTGGAGCTGCAGGATGGT 

GAGGACGAGGGCTACTTCCAGGAGCTGCTGGGCTCTGTCAACAGCTTGCT 

AGATG^GCCCCACACCAGGACTGGACTGGGATGCCGCTAGTGAGGCTGAGGGGTGCCAG 

CGTGGGTGGGCTTCTCAGGCAGGAGGACATCTTGGCAGTGCTGGCTTGGCCATTAAA 

AAACCTGAAGGCCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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Figure 110 

MAPQSLPS SRMAPI/SMLLGLLMAACFTFCLSHQNLKEFALTNPEKSS 1 KE 1 LKKLTKA EE 
ELDAEVLEVTHPTHEWQAI^PGCAVPAGSHVRLN^ 
IimTTYTSQDIJCSALAKFKEGAEMESSKEDKAROAEVKRLr^ 
DMQIMVRLINKFNSSSSSLEEKIAALFDLEYYVHQMDN^ 

LVKEYAAFVLGAAFSSNPKVQVEAI EGGALQKLLVT LATEQPL.T7UCKKVLFALCSLLRHF 
PYAQRQFXKLGGI^VU*TLVQEKGTEVIAVRVV^ 
LQQYRQVHIiLPGLWEQGWCEITAHLXJUjPEHDAREKVLQTLGV^ 
TLASLQAEYQVLASLELQDGEDEGYFQELLGSVNSLLKELR 
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Figure 111 

AA^TATCATCTATTTATCATTAATCAATAATGTATT CTTTTATTC CAATAACATTTGGG 

TTTTGGGATTTTAATTTTCAAACACAGCAGA 

A^CATTTTTTCTCTCACTATTATrATTGTTGGTATCT 

TCAGGAAGCAACACATTGGAGAATGGCTACTTTCTATQ\AGAAATAAAGA 

CAACCCACACAATCATCTTTAGAAGACAGTGTGACTCCTACCAAAGCTGTCAAAACCA 

GGCAAGGGCATAGTTAAAGGACGGAATCTTGACTC^AGAGGGTTAAT^ 

QOCTGGGGCAGGGGT^^ 

- TACACATATTAGTATTACCTTAGTGTAATGTATCCCTGTCATATATACAAXAAGCT 
TTATAAGTACCCTATGCAGTTGGCTGGAC 

AGTTCTAAATTGGACTTTATTAATTTTTAAAATCAGTAACT 

GCTTAGATCTA CAGGAGATCATATAATTTGATA CAAATAAAAGAAAAGTGTTCTCTCCCC 
TTACAGAATTGACATTTTAAATGCGATACAGTTAGAATAGGAAATATGACATra 
AAGAATGACAGGGAGAAAGGAAAGAAGGG AAAATGTTG CCAAGGAAAAAAAAA 
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Figure 112 

MTFFLSIXLLLVCEAIWRSNSGSNTLENCrre 
GKGIVKGRNLDSRGLILGAEAWGRGVKKNT 
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Figure 113 

TGAA.GGACTTTTCCAGGACCCAAGG 
CTCCTTGGCCTCCGCAGCCGATCAC 

A^aAAGGTGGTGCCAAGTCTCCTGCTCTCCGTCCTCCTCSGCACAGGTGTGG CTG 
GTACCCGGCTTGGCCCCCAGTCCTCAGTCGCCAC^GACCCCAGCCCCTCAGAACCAGArc 
AGCAGGGTAGTGCA.GGCTCCCAGGGAGGAAGAGGAAGATGAGCAGGAGGCCAGCGAGGAG 
AAGGCCGGTGAGGAAGAGAAAGCCTGGCTGATGGCCAG CAGGCAGCAG CTTG CCAAGGAG 

ACTTCAAACTTCGGATTCAGCCTGCTGCGAAAGATCTCCATGAGGCACGATGGCAACATG 
GTCTTCTCTCCATTTGGCATGTCCTTGGCCATGACAGGCTTGATC 

CCGACTGAAACCCAGAT CAAG AGAGGGCTCCACTTGCAGGCCCTGAAGCCCACCAAGCCC 
GGGCTCCTGCCTTCCC TCTTT AAGGGACTCAGAGAGACCCTCTCCCGCAACCTGGAACTG 
GGCCTCTCACAGGGGAGTTTTGCCTTCATCCACAAGGATTTTGATC 

TTCAATTXATCCAAGAGGTATTTTGATACAGAGTGCGTGCCTATGAATTTTC 
TCACAGGCCAAAAGGCTCATGAATCATTACATTAACAAAGAGACTCGGGGGAA^ 
AAACTGTTTGATGAGATTAATCCTGAAAC CAAATTAATTCTTGTGGATTACATCTTGTTC 

GACAAGTACAAGACCATTAAGGTGCCCATGATGTACGGTGCAGGCAAGTTTGCCTCCACC 
TTTGACAAGAATTTTCGTTGTCATGTCCTCAAACTGCCCTACCAAGGAAATGCC^ 
CTGGTGGTCCTCATGGAGAAAATGGGTGACCACCTCGCCCTTGAAGACTACCTGACCACA 
GACTTGGTGGAGACATGGCTCAGAAACATGAAAACCAGAAACATGGAAG1 1 1 1'LTT T CCG 
AAGTTCAAGCTAGATCAGAAGTATGAGATGCATGAGCTGCTTAGGCAGATGGGAATCAGA 
AGAATCTTCTCACCCTTTG CTGACCTTAGTGAACTCTCAGCTACTGGAAGAAATCTCCAA 
GTATCCAGGGTTTTACGAAGAACAGTGATTGAAGTTGATGAAAGGGGCACTGAOT 
GCAGGAATCTTGTCAGAAATTACTGCTTATTCCATGCCTCCTGT^ 



GTGAATCCGACTCTCCTAT^TTCAGGACATGCATAAGCACTTCGTGCTGTAGTAGATGC 
TGAATCTGAGGTATCAAA» 

CTTTTGTTCTTAACTAGTTT 

AGGGTGTTCTCAAATAAATACAGTAGTC^ 

CCCCAGCAGATGCCTGAAACGGTGGACAGTGCTC^CCTTA 

ATACATACCTATGATAAAGTTTAATTTATAAATTAGGCACACTAAGA 

AACAACATTAAGTAAAATGAGTTACTTGAACGCAAGCACTGC^TAC 

CTCATTATAGAGAAGGCTACTAAGTGACTCATGGGCGAGGAGCATAGACAGTGTGGAGA^ 

ATTGGGCAAGGGGAGAATTCACATCCTGGGTGGGAQ^GAGCAGGACG^ 

TCC CACTACTCAGAATGG CATGCTG CTTAAGACTTTTAGATTGTTTATTTCTGGAATTTT 

TaTTTAATGTTTTTGGACCATGGTTGACCATGGTTAACT 

»T^TAAGTCAGGACTACTACAAAAGCATTA^ 
AAAAAAAAAA 
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Figure 114 

MKVVPSLLLSVLIAQVWLVPGtAPS PQS PETPAPQNQTSR WQAPREEEEDEQEASEEKA 

GEEEKAWLMASRQQLAKETSNFGFSLIJ^ISMRHD^ 

ETQIKRGLHLQALKPTKPGIXPSI^KGIJIETI^RNLELGLS^ 

LSKRYFDTECVPMNFRNASQAK^LMNHYINKETRGKI PKLFDEINPETKLILVDYILFKG 
KWLTPFDPVFTEVDTFHLDKY ECTI KVPMMYGAGKF ASTFDKNFRCHVL KL P YQGNATMLV 
VI^EKMGDHLALEDYLTTDLVETWLRNM^ 

FS PFADI^ELSATGRNLQVSRVLRRTVI EVDERGTEAVAGII^ EITAYSMPPVI JCVDRPF 
HFMI YEETSGMLLFLGRWNPTLL 
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Figure 115 

CGGCAACCAGCCGCCGCCACCACCGCTGCCACTGCCGCCCTGCCGGGGCC 



ATGTTCGCTCTGGGCTTGCCCTTCTTGGTGCT 

GGGGTTCTGGGGCCCAAGAACGTCTCGCAGAAAGACGCCGAGTTTGAGCGCACCTACGT 
GACGAGGTCAACAGCGAGCTGGTCAA.CATCTACACCTTCAACCATACT 




GGGATGTTTCAGCGCAAGTACCTCTACC^AAAAGTGGAACGAACCCTGTGTCAGCCCC 

ACCAAGAATGAGTCGGAGATTCAGTTCTTCTACGTGGATGTGTCCACCCTGTCACCAGTC 

AACACCACATACCAGCTCCX5GGTCAGCCGCATGGACGATTTTGTGCTCAGGA 

CAGTTCAGCTTCAATACCACAGCAGCACAGCCCCAGTACTTCAAGT^ 

GGCGTGGACTCGGTAATTGTCAAGGTGACCrrCCAACAAGGCCT 

TCCATTCAGGATGTGCTGTGTCCTGTCTATGACCTGG^ 

ATGTACCAGACGATGACCAAGAAGGCGGCCATCACCGTACAGCGCAAAGACTTCCCCAGC 
AACAGCTTTTATGTGGTGGTGGTGGTGAAGACCGAAGAC 
CCTTTCTACCCCTITCGCAGAAGATGAACCGGTCGATCAAGGGC^ 
TCAGTGCTGGTGTCTCAAGCAGTCACGTCTGAGGC 

ATACGTCAGTGGGATGCTCTTTTGCCTGGGTATATTTCTCTCCTTTTACCTGCTGACCGT 

CCTCCTGGCCTGCTGGGAGAACTGGAGGCAGAAGAAGAAGACCCTGCTGGTGGCCATTGA * 

CCGAGCCTGCCCAGAAAGCGGTCACCCTCGAGTCCTGGCTGATTCTTTTCCTG 

CCCTTATGAGGGTTACAACTATGGCTCCTTTGAGAATGTTTCT 

GGTTGACAGCGCTGGCACTGGGGACCTCTCrTACGGTTACCAGGGCC 

TGTAGGTACTCGGCCCCGAGTGGACTCCATGAGCTCTGTGGAGGAGGATGACTACGACAC 

ATTGACCGACATCGAITCCGACAAGAATGTCATTCGCACCAAGCAATACCT 

TGACCTGGCACGGAAGGACAAGCGTGTTCTGCGGAAAAAGTACCAGATC 

CATTGCCACCATTGCTGTCTTCrATGCCCTT 

GACGGTGGTGAATGTC^CAGGGAATCAGGACATCTGCTACT 

CCCACTGGGCAATCTCAGCGCCrTCAACAACATCCTCAGCAACCTO 

GGGGCTGCTTTCCCTGCTCATCATCCTGCAACGG 

GCGCAATGACCTCTGTGCCCTGGAATGTGGGATCCCCAAACACTTTGTC 
CATGGGCACAGCCCTGATGATGX^GGGGCTGCTC^ 
CTATACCAATTTCCAGTTTGACACATCGTTCATGTACATGATOT 
GAAGCTCTACCAGAAGCGGCACCCGGACATCAACGCCAGCGCCT^ 

CAAAGGGAACACGGCGTTCTGGATCGTCTOCT 

CCTCAGCACGCAGCTCTATTACATGGGCCGGTGGAAACTGGACTCGGGGATCTTCCGCCG 

CATCCTCCACGTGCTCTACACAGACTGCATCCGGCAGTGCAGCGGGCCGCTCT 

CCGCATGGTGCTGCTGGTC^TGGGCAACGTCATCAACTGGTCGCT 

TATCATGCGCCCCAATGATTTCGCTTCCTACTTGTTGGC^ 

CCTTTACTTCGCCTTCTACATCATCATGAAGCTCCGGAGTC 

CCCCCTGCTCTGCATCGTTTGCACCTCCGTGGT { 

CCAGGGACTCAGCACCTGGCAGAAAACCCCTGCAGAGTCX3AGGGAG 

CATCCTCCTCGA C T lVriT G^CGACCACGACATCTGGCACTTCCTCTCCTCCATCGCCA^ 

GTTCGGGTCCTTCCTGGTGTTGCTGACACTGGATGACC^ 

CGGGACAAGATCTATGTCTTCTAGCAGGAGCTGGGCCCTTCG 

TGAGCTCCTTTGTGTCATAGACCGGTCACTCTGTCGTGCTGTGGGGATGAGTCCCAGCAC 

CGCTGCCCAGCACTGGATGGCAGCAGGACAGCCAGGTCTAGCTTAGG^^ 

AGCCATGGGGTGGCATGG&ACCTTGCAGCTGCCCTCTGCCGAGGAGCAGGC 

TT3GAACCCCCAGATGCTGGCCAAATTGCTGCTTTCTTCT 

GCCCCTGTCCTTTGGCTCTCCATTTGTCCCTTTGCAAGAGGAAG^ 

TCCCCATTTCATGCCTITGCATTTTGCCCGTCCTCCT 

CTAAGGCCTCTTTTTCCTCCCATACTCCC^CT^ 

TGTCCTGTATCAGGGCCCCAGTTCTCTTTGGGCTGTCCCTGGCTGCCATCACTGCC 
CCAGTCAGCCAGGATGGATGGGGGTATGAGATTTTGGGGGTTGGC^ 
TTTTGGTGCTAAGGCCTGCAAGGGGCCTGGGGCAGTGCGTATTCTCTTC^ 
TGCTCAGGGCTGGCTCTTTAGCAATGCGCTCAGCCCAATTC 
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AAGAGGCTGAATTCAIaAGGTCACCTCTTCATCCCATCAGCTCCCAGACTGATGCCAGCAC 
CAGGACTGGAGGGAGAAGCGCCTCACCCCTTCCCTTCCr^ 
GCCAAACCCCAGCTGGTGGCXTTTCAGTGCCATTGAC^ 
AAAGGAGGGATGATACAGAGTTCAGCCCGTTCTG 

CCTCCA CAGCTGTGGGCACCCCAGTGCCTACCTTAGAAAGGGGCTTCAG 
CTGTTTCCCTCTACGTGCCCAGTCCTAGCCTCGCT 
TTTCCGTCCAGTCTTCAGGCAAGTTCTGTGTTAGTCATGCACACACAT^ 
TTGGAGTTTACAAAGAATTGCCCCAGCTCTGGGCACCCTGGCCAC 

CC CTTC GTCCCACCTGGTCCACCCCAGATGCTGAGGATGGGGGAGCTCAGGCGGGGCCTC 
TGCTTTGGGGATGGGAATCT G T n 'r i CTCCCAAA CTl^ 

GCTGGGAGATGAGGTCGGTCTGGATCTTTTCTCAGAGCGTCTCCATGCTATGGTTGCACT 
TCCGTTTTCTATGAATGAATTTGCATTCA^^ 
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Figure 116 

MFALGLPFLVLLVASVESHLGVLGPKNVSQKDAEFERTYVDEVN 
RTEGVRVSVNVU^KGLAPLIJ^VVROKEAWSFQVP 

TKNESEIQFFYVDVSTLSPVNTTYQLRVSRMDDFVLRTGEQFSFNTTAAQPOYFICyBFPE 
GVDSVIVKVTSNKAFPCSVTSIQDVLCPVYDI^NN^ 

NSFYVVVVVKTEDOACGGSLPFTPFAEDEPVDQGHRQKTLSVLVSQAVTSEAyVSGMLFC 
LGIFI^FYLLTVIXA<^NWRQK3CXTI^^ 

SFENVSGSTDGLVDSAC3TGDLSYGYQGRSFEPVGTRPRVDSMSSVEEDDYDTLTDIDSDK 
NVIRTKQYLYVADIARKDK^VLRKECYQIYFWNIATIAVFYALPW 

QDI CYYNFLCAHPLGNLS AFNNTLSNLGYI LLGLLFLL 1 1 LQRE I NHNRALLRNDLCALE 
CG I PIOTFGLFYAMGTALMMEGLLSACYHVCPNYTNFQFDTS FMYMIAGLCMLKLYQKRHP 
D I NASA YSA YACLAI VI FFSVLGWFGKGNTAFWI VFS I IHI I ATLLLS TQL YYMGRWKL 
DSGIFRRIIjn^YTDCIRQCSGPLYVDRMVLLVMGNVINWSIJ^YGLI^PNDFASYI^ 
IGICNL1XYFAFYIIMKLRSGERIKLIPLLCIVCTSVVWGFALFFFFQGLSTWQKTPAES 
R EHNRDCILLDFFDDHDI WHFLSS I AMFGSFLVLLTLDDDLDTVQRDKI YVF 
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Figure ! 17 

GA CTTTG CTTGAAT GTTT ACATTTTCTGCTCO 
ACGTTTTGTTAAAACTTTGGGGTGTCAGGAGTTGLAGCTTG 

ATGGCTAGGATGAGC l"l'ltalUATAGCAGCTTGCCAATTGGTGCTGGGCCTACTAATGACT 

CGTCCCTGGTTTACCCCACAGTGAACTTACAGAG^ 

CTCCGCTTAACAAGGATTCCCAGTAACCTCTCTAGTC^ 

AGCAATAACATCGCGAAGACTGTGGATCAGCTGCAGC^ 

GATTTCTCCCAAAACAACTTTACTAACATTAAGGAGGTCGGGCT 

CTCACAACGCTG CATTTGGAGGAAAATCAGATTACCGAGATGACTGATTA 

GACCTCAG CAAC CTTCAAGAACTCTACATCAAC CACAACCAAATTAGCACTATTTCTGCT 

CATGCTTTTGCAGGCTIAAAAAATCXATTAAGG 

GTTATTGATAGTCGCTGGTTTGATTCTACACCCAACCT 

AACCCTGTGATTGGAATTCTGGATATGAACTTCAAACC^ 

GTTTTGGCAGGAATGTATCTCACTGATATTCCTGGAAATGCTT^ 

CTTGAGAG CCTGTCTTTTTATGATAACAAACTGGTTAAAGTCCCTCAACTTGCCCTG CAA 

AAAGTTCCAAATTTGAAATTCTTAGACCTCAACAAAAACC^ 

GGGGACTTCAAAAATATGCTTCCXnTAAAAGAACTGGGAATCAA 

GTTTCTGTCGACCGCTATGCCCTGGATAACTTGCCTGAACTCACAAAGCTGGAAGCCACC 

AATAACCCTAAACTCTCTTACATCCACCGC^ 

AGCTTGATGCTGAACAACAATGCCTTGAATGCCATTTAC 

CCCAATCTGCGTGAGATCAGTATCCATAGCAATCCCCTCAGGTGTGACTXn'GTGATCCAC 

TGGATTAACTCCAACAAAACCAACATCCGCTTCATGGAGCCCCTGTCCATGTTCTGTGCC 

ATGCCGCCCGAATATAAAGGGCACCAGGTGAAGGAAGTTTTAATCCAGGA 

CAGTGC CTCCC AATGATATCTCACGACAGCTTCCCAAATCGTTTAAACGTGGATATCGGC 

ACGACGGTTTTCCTAGACTGTCGAGCCATGGCTGAGCCAGAACCTGA^ 

ACTCCCAT TGGA AATAAGATAACTGTGGAAACCCTTTCAGATAAATACAAGCT 

GAAGGTACCTTGGAAATATCTAACATACAAATTGAAGACTCAGGAAGATACACATGTC 

GCCCAGAATGTCCAAGGGGCAGACACTCGGGTGGCAACAATTAAGGTTAACGGGACCCCT 

CTGGATGGTACCCAGGTGCTAAAAATATACGTCAAGCAGACAGAATCCCATTCCATCTTA 

GTGTCCTGGAAAGTTAATTCCAATGTCATGACGTCAAACTTAAAATGGTCCT 

ATGAAGATTGATAACCCTCACATAACATATACTGCCAGGGTCCCAGTCGATGTCCATGAA 

TACAACCTAACGCATCTGCAGCCTTCCACAGATTATGAAGTGTGTCTCACAGTCT 

ATTCATCAGCAGACTCAAAAGTCATGCGTAAATGTCACAACCAAAAATCCCGCCTTCGtt 

GTGGACATCTCTGATGAAGAAACCAGTACAGCCCTTGCTC 

GCCGTCATTAGCCTTGCGTCCATTGCTGTGTACTTTGCCAAAAGATTTAAGAGAAA 

TACCACCACTCATTAAAAAAGTATATGCAAAAAACCTCTTCAATCCCACTAAATGAGCTG 

TACCCACCACTCATTAACCTCTGGGAAGGTGACAGCGAGAAAGACAAAGATGGTTCTGtt 

G AC^CC AAGCCAACCCAGGTCGACACATCCAGAAGCTATTACATGTGGTAACT 

TATTTTGCTTCTGGTAGTAAGGAGCACAAAGACGTTTTTG 

CAAGTTGAAGACTTTTGTATTTTTGACTTTG CTAGTTTGTGGCAGAGTGGAGAGGACGGG 
TGGATATTTCAAATTTTTTTAGTATAG CGTATCXSCAAGGGTTTGACACGGCTGCCAGCGA 
CTCTAGGCTTCC^ 

TTATTATTTTATTTTAGTTGTTGTGCTAAACTCAATAATGCTGTTC^ 

AATAAAATGATTAATGAttGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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Figure 118 

MAiWSFVIAACQLVLGIiMTSLTESSI QNSECP^ 
DLRLTRIPSNI^SOTQVT^LQSNNIAJCTVDELQ^ 

TQLTTI»HLEENQI TEMTDYCLQDLSNLQELYI NHNQ1 STI SAHAFAGLKNLLRLHLNSN 
KLXVTDSRWFDSTPNLEII^IIGENPVIGILDMNFKP^ 

GLDSLESLSFYDNKLVKVPQLALQKVPNLKFLDLNKNPIHKI QEGDFKNMLRliKELGIK 
NMGELVSVDRYALDNLPELTKLEATNNPKLSYIHRIAFRSVP^ 

KTVESLPNLREI S IHSNPLRCDCVI HW INSNKTNIRFMEPLSMFCAMPPEYKGHQVKEV 
LIQDSSEQCLPMISHDSFPNRLOTDlGTlVKI^CRAMAEPEPEIYWVTPIGNKITVEn 

sdkyklss egtleisniqiedsgrytcvaq*tvt/^trvatikv^ 
kqteshsilvswkvnsnvmtsnijcwssa™ 

d yevcltvsnihqqtqkscvnvttknaafavdi sdqetstaiaavmgsmfavislas ia 
vyfakrfkrknyhhslkkymqktss i plnelyppl xnlwegdsekdkdgsadtkptqvd 
tsrsyymw 
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Figure ) 19 

CCCACGCGTCCGCCCACGCGTCCGGGTGCCACTCGCGCGCCGGCCGCGCTCCGGGLTlVr 
CTTTTCCCTCCGACGCGCCAOSGCTGCCCAGAC^^ 

CCCGAACCCCTCCGCGGAGAGGAGCGAGGCGGCGCCAGGGTGGCCCCCGGGGCGCGCTTG 

GTCTCGGAGAAGCGGGGACGAGGCCGGAGGATGAGCGACIX^GG 

CGCGAGTTGGGGCCGCGACTACCGGCAGCTGACAGCGCGATGAGCGACTCCCCA 

CCCTAGCCCGGTGTGCGCGCCAGGCGGMCGCGCAGGTGGGGCTGGGCT<nTAGTGGTCC 

GCCCCACGCGGGTCGCCGGCCGGCCCAGGATGGGCGCTGGCAACCCGGGCCCGCGCCCGC 

CGCTGCTACCCCTGCGCCCGCTGCGAGCCCGGCGTCCGGCCCGCGCCCTGCGCTCATGGA 

CGGCGGCTCCCGGCTGGCGGCGGCGCGCCCCCGGGCTGTGAATGCGACTCGCCCCTCGGC 

CGCGCTCCCCGCCCGCCCGCCCGCCGGGACGTGGTAGGGG 

ATCCCCAGCTCaCTGCGATGGCAGTTGGCGCGCTCT 

TGCCTGATGGTGGCTCTGTGCAGTCCGAGCATCCCGCTGGAGAAGCTGGCCCAGGCACCA 

GAGCAG CCGGG CCAGGAGAAGCGTGAGCACGCCA CTCGGGACGGCCCGGGGCGGGTGAAC 

GAGCTCGGGCGCCCGGCGAGGGACGAGGGCGGCA^CGGCCGGGACTGGAAGAGCAAGAGC 

GGCCGTGGGCTCGCCGGCCGTGAGCCtntSGAGCAAGCTGAAGCAGGCCTGGGTCTCCCAG 

GGCGGGGGCGCCAAGGCCGGGGATCTGCAGGTCCGGCCCCGCGGGGACACCCCGCAGGCG 

GAAGCCCTGGCCGCAGCCGCCCAGGACGCGATTGGCCCGGAACTCGCGCCCACGCCCGAG 

CCACCCGAGGAGTACGTGTACCCGGACTACCGTGGCAAGGGCTGCGTGGACGAGAGCGGC 

TTCGTGTACGCGATCGGGGAGAAGTTCGCGCCGGGCCCCTCGGCCTGCCCGTGCCTGTGC 

ACCGAGGAGGGGCCGCTGTGCGCGCAGCCCGAGTGCCCGAGGCTGCACCCGCGCTGCATC 

CACGTCGACACGAGCCAGTGCTGCCCGCAGTGCAAGGAGAGGAAGAACTACTC 

CGGGGCAAGACCTATCAGACTTTGGAGGAGTTCGTGGTGTCTCCATGCGAGAGGTGTCGC 

TGTGAAGCCAACGGTGAGGTG 

CTATGCACAGTGTCAGCGTGTCCCCAGACGGAGTGTGTGGACCCTGTGTACGAGCCTGAT 

CAGTGCTGTCCCATCTGCAAAAATGGTCCAAACTGCTTTGCAGARACCGCGGTGATCCCT 

GCTGGCAGAGAAGTGAAGACTGACGAGTGOVCCATATGCCACTGTACTTAT 

ACATGGAGAATCGAGCGGCAGGCCATGTGCACGAGACATGAATGCAGGCAAAT GTAGA CG 

CTTCCCAGAACACAAACTCTGACTTTTTCTAGAACATTTTACTGATC 

TGACTCrTGGGAACTATCAGTCAAAGAAGACTITTGATGAGGAATA^ 

TACTTTTCC I Tt 1'Lll'GATAACAGTTACTACAACAGAAGGAAATGGATATATTTCAAAAC 

ATCAACAAGAACTTTGGGCATAAAATCCTrCTCT^ 

GTACACAAAAGTACACTATTATATATCAAATGTATTTCT^ 

TTATATAAGTGTTTTCTATAGATGCAGATTAAAAaTGC^ 

AAAAAAAAAAAAAAAAAAAAA 
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Figure 120 

M PS STAMAVGALS SSLLVTCCLMVALCS PS I PLEKLAQAPEQPGQEKREHATRDGPGRVN 
EIrGRPARDEGG SGRDWKS KSGRGLAGRE PW S KIJCQAOTSQGGGAKAGDI^VRPRGDTPQA 
EALAAAAQZiAIGPEIiAPTPEPPEEWYPDYRGKXvCVDESGFVYAXGEXFAPGPSACPCLC 
TEEGPLCAQPECPRLHPRCIHVDTSQCCPQCKERKNYCEFRGKTYQTLEEFVVSPCERCR 
CEIANGEVLCTVSACPQTECVDPVYEPDQCCPICKNGPNCFAETAVIPAGREVKTDECT 
HCTYEEGTWRI ERQAMCTRHECRQM 
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Figure 121 

CGCGGAGCCCTGCGCTGGGAGGTGCA^ 
CCGCGCCCTGCGCCTTAACCAGGACTGCT^^ 

AAAG 

ATGG CTTTAAAAGTGCTACTAGAACAAGAGAAAACUl^LT^ 

GGCTATTTGTCATGTAAAGTGACTTGTGAATCAGGAGACTGTA 

GATCGGTCTGGAAACTGTGTTCCCTGCAACCAGTC 

GAATGTGGCTTCGGCTATGGGGAGGATGCACAGTGTGTGACGTGCCGGCT^ 

AAGGAGGACTGGGGCTTCCAGAAATGCAAGCCCTGTCTGGAC^ 

TTTCAGAAGGCAAATTGTTCAGCCACCAGTGATGCCATCTGCGGG 

ITlTATAGGAAGUlCGAAAL'rriaTCGGL i I l'LAAGACATGGAia HST u TGCCTTGTGGAGAC 

CCTCCTCCTCCTTACGAACCGCACTGTGCCAGCAAGGTCAACCTC^ 

ACGGCCTCCAGCCCACGGGA 

CACGGCGCTGGCTGCCGTTATCTGCAGCGCTCTGGCCACCGTCCTGCTGGC^ 

CCTCTGTGTCATCTATTGTAAGAGACAGTTTATGGAGAAGAAACCCAGCTGGTCTCT 

GTCGCAGGACATTCAGTACAACGGCTCTGAGCTGTCGTGTTTTGACA 

CGAATATGCCCACAGAGCCTGCTGCCAGTGCCGCCGTGACTCAGTGCAGACCTGCGGGCC 

GGTGCGCTTGCTCCCATCCAT G T G CTGTGAGGAGGCCTGCAGCCCCAACCCGGCGACTCT 

TGGTTGTGGGGTGCATTCTGCAGCCAGTCTTCAGGCAAGAAACGCAGGCCCAGCCGGGGA 

GATGGTGCCGA Cl w I T Cri 'CGGATCCCTCACGCAGTCCATCTGTGGCGA GlTlT CAGATGC 

CTGGC CTCTGATGCAGAATCCCATGGGTGGTGACAACATCrLTri'T PGTGACTCTTATCC 

TGAACTCACrGGAGAAGACATTCATTCTCTCAATCCAGAA 

GGATTCAAATAGCAGTCAAGATTTGGTTGGTGGGGCTGTTCCAGTC 

AAACTTTACAGCAGCTACTGATTTATCTAGATATAACAACACACTGCT 

AACTCAGGATGCACTAACTATGAGAAGCCAGCTAGATCAGGAGAGTGGCGCTGTCATC^ 

CCCAGCCACTCAGACGTCCCTCCAGGAAGCTTAAAGAACCT 

GCGTGTGCTGGAACCCAAAGAGTACTCCTTlOTTAGGCTTAlXKa^ 

CTTGCATGGCTTCTGGGGCAAAAATAAATCTGAACCAA^ 

CAGCCAGTTCCTTCTGAGCCAGACCAGCTCrrAAGCrGAAAC 

AGACTCCAGGCCGACTCATGATACTCTGCATCTTTCCTACATGAGAAGCTTCTCTGCCAC 

AAAAGTGACITCAAAGACTGATGGGTTGAGCTGGCAGCCT 

CAAGAAACAGAAATGCCCTCATGCTTATTTTCAT^ 

GACC CAGAGTATA C1"IV I TLTUT CCAGAAATAATTTCATAC CG CCTATGAAATATCAGAT 

AAATTACCTTAGCTTTTATGTAGAATGGGTTCAAAAGTGAGTGTT^ 

CACITTTTCATCATCTAAACTGATTCGCATAGGTGGTTAGAATGGCCCT 

CCTAAATCTTGGGTTTATTAGATGAAGTTTACTGAATCAGAGGAATCA^ 

TAGCTCTTTCCAGAATCCACACTITCTGACCTCAGCCTCGCT 

CTCAGGAGAACA.CCTGGGCTAGGGAATGTGGTCGAGAAAGGGCAGC 

TAACACATATTGTAGAGACTTGTATGCAAAGGTTGGCATATTT^ 

TATAGAAACATTT GT TGCATCTGTCCCTCTGCCTGAGCTTAGAAG 

TATTTATAAACATAAATGACCITTTACTTGCATTC 

TTACAACATATCAGGTTCCCCTACTACTGAAGTAGCCTTCCGTGAGAACACACCA 
TAGGACTAGAAGAAAATGGACAATTTGTAGGGGTTTGGATGAAGCA 
AGTGTAGTTTGACCAGGACATTGTCGTGCTCCTTCCAATTGTGTAAGATTAGTO 
TCATCTCCTAL"ri"l!AGCCATCCGGl^TrGGATTTAAGAGGACGGTGC TTCTTTC l'ATTAA 
AGTGCTCCATCCCCTACCATCTACACATTAGCATTGTCT 

ACCCCGTTCAGTCACAAAGCAGGGAATGGTTCATTTACTCTTAATCTTTATGCCCT 
AAGACCTACTTGAACAGGGCAT A ' l 1 ' 1 T ITA GACTTCTGAACATCAGTATGTTCGAGGGTA 
CTATGATATTTTGGTTTGGAATTGCCCTGCCCAAGTCACrGTCri^ 
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Figure 122 

MAIiKVUjEQEKTFFTLLVIXGYLSCKVTCESGD 
ECGFCTGEDAQC\TTCRIJ£RFm5WGFQKCK2^^ 

FYRKTKL VGFQDMECVP CGD P P P PY E PHCAS KVNLVKIAS TASS PRDTALAAVI CSALAT 
VLLALLILCVI YCKRQFMEKKPSWSLRSQDIQYNGSELSCFDRPQLHEYAHRACCQCRRD 
SVQTCGP VRLLPSMCCEEACS PNPATLGCGVHSAASLQARNAGPAGEMVPTFFGSLTQS I 
CGEFSDAWPLMQN"PMGGDNISFCDSYPELTGEDIHSLNPELESSTSLDSNSSQDLVGGAV 
PVQSHSEHFTAATDLSRYNNTLVESAS TQDALTMRSQLDQESGAVTHPATQTSLQEA 
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Figure 123 

GGGTGATTGAACTTUVACCTTCGCCGCACCG^ 

TGCCTGCTrGCGGTTGGAGAAATCAAGGCCCTACCGGGCCrc 

GTGG6CGTGGCCGAGGCC6GGGTGACCCTGCCGGAGCCTCCGCTGCCAGCGAC 

ATGTTCAAGGTAATTCAGAGGTCCGTGGGGCCAGCCAGCCTGAGCTTGC^ 

GTCTATGCAGCACCAAAAAAGGACTCACCTCCGAAAAAT^ 

TCACTCTACTCAGTTCCTGAGGGTCAATCGAAGTATGT^^ 

GAAGAAAGCATCTCACAGCTCCGACACTATTGC 

ACGTACTCCCAAACTAAGCCCAAGATGCAAAGTTTGGTTCAATGGGGGT^ 

GACTATCTCCAAAATGCACCTCCTGGATTTTTTCCGAGACTTGGTCnTATT 

GGCCTTATTGGACTCCTTTTGGCTAGAGGTTCAAAA^ 

GGTTTCATGGGATTAGCTGCCTCCCTCTATTATCCAC^ 

GTCAGTGGGGAGA^TTATATGACTGGGGTTTACGAGGATATAT^ 

TGGAAGGAGAACTTTCAAAAGCCAGGAAATGTGAAGAATTCACCT 

actccatgctctgccatcttaatcagttataggtaaacattggaa^ 

tcagtatttctacagaaaaatggcatagaagtcagtattgaatct^ 

ttcttcaggaaaaactagaccagacctctgttatcttctgtgaaat 

actaacctggaatcccttcacctagagataatgtacaagccttag 

tgttgctatttatgtacctaattaaaacccaagtttaaaaaaaaaaaaaaaaaa;^^ 

aaaaaaaaaaaaaaa 
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Figure 124 

MFKVIQRSVGPASLSLLTFKVYAAPKKDS PPKNSVKVDELSL YSVPEGQSKYVEEARSQL 
EESISQUWYCEPYTTWCQETYSQTKPKMQSLVQWGI^ 
GLIGLLLARGSKIKKLVYPPGFMGIJUSL^^ 
WKKNFQKPGNVKNS PGTK 
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Figure 125 

TTGAAAATCTACTCTATCAGCTGCTGTGGTTGCCAC 
ATGAAAGCCCTTATGCTGCTCACCCTGTCnSTTCTCCT 

GACATTCGCTGTCACTCCTGCTACAAGGTC CCTGTGCTGGGCTGTGTGGACCGGCAGTCC 
TGCCG CCTGGAGCCAGGACAGCAATGCCTGACA^^ 
GTTTTCTCCAATCTGCGCTGTGGCACACCAGAAGAGCCC^ 
ACCAACCGCAAGCTGGGTCTGAC^ 

AGCGCAGGACCCCGGCCCACTCCAGCCCTGGGCCTTGTCTrCCTTACCTCCTTGGCTGGC 
CTTGGCCTCTGGCTGCTGCACTGAGACTC^^ 

TTGGCCTGAGCCTCTCTCCCTGTGTCTCTGTATCrCCTGGCTTTACAGAATCGTCTCTCC 

CTAGCTCCCATTTCTTTAATTAAACACTGTTCCGAGTGGTCTCCT 

CCTCACACCCTTCACTCTCCTTTTTCTGGGTCCCTTCCCACTTC 

TGGCTCCTAGAAGGGCTCCCCACTTTGCTTCCTATACTCTGCTGTCC^ 

GGATTGGGATCTGGGCCTGAAATGGGGCTTCTGTGTTGTCCCCAG 

GGACCTGATGACCTCACTGTACAGAGCTGACTCCCCAAACCCAGGCTC 

CATCCCCCATACTCACCTCTTTCCATTTTGAGTAATAAATGTCTGAGTCTGGAAAA 
AAAAAAAAAA 
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Figure 126 

MKALMLLTLSVI^CWVSABIRCHSCYKVPV^ 

S^CGTPEEPCQEAFNQTTOIKLGLTYNTTC 

XfWLLB 
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Figure 127 

ACC 

ATCGATTGCCAAGAAAATGAGTACTGGGAC^^ 

TGTGGTCCTGGA»GGAGCXATCCAAGGATTGTGGTTATGGAGA^ 

TGCAOVGCCI^CCTCCTCGCAGGTACAAAAGCA 

TGCATCACCITn^CTGTCATCAATCGTGTTCAGAAGGTCAACT 

GCTGTCTGTGGGGACrGTTTGCCCAGGTTCTO 

GACCAAGAGTGCATCCOnX3CACGJU^G<^GACCCCCACCrC^ 

CAGTTGAGCTTAGTGGAGGCAGATGCACCCACAGTGCCCCCTCAGGAGGCCACA 

GCACTGGTGJUjCAGCCTGCTAGTGGTGTTTACCCTGGCCTTCCT 

TACTGCAAGCAGrrCTIXJAACAGACATTGCCAGCGTGTTACAGGAG 

GAGGCTGATAAAACAGCAAAGGAGGAATCTCrCTTCCCCGTGCC^ 

AGTGCTGAGTCCCAAGTGAGTGAGAA»TCTTTCAGArc 

GAGGACGACTGCAGCTCGACTAGTGGCTTCCCCACACAGGAGTCCTTTACCAT^ 

TGCACCrrC^GAGAGCCACTCCCACTGGGTCCACAGCCCC^ 

CTGCAAAAGTTTTCC^GCTCTGCCTCCTATACrGGA^ 

GTCGAAAGCACTGGAGACAGGCTGGAGCTCAATGTGCCCTTTGAAGTTCCCAGCCCCT 
GC 
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Figure 128 

MDCQENEYWDOWGRCVTCQRCGPGQELSKDCGYGEGGDAYCTACPPRRYKSSWGHHRCQ 
SCI TCAV1NRVQKVNCTATSNAVCGDCIJRFYRKTRIGGLQDQBCI PCTKQTPTS BVQC 
AFQI^LVEADAPTVPPQEATLVALVSSLLVVFTIAFLGL 

IiQPEADKTAKEESLFPVPPSKETSAESQVSENI FQTQPLNP ILEDDCSSTSGFPTQESF 

TMASCTSESHSHWVHSPIECTELDLQKFSSSASYTGAETI^^ 

VPSP 
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